Research – Page 3 – UKRI Centre for Doctoral Training in Artificial Intelligence and Music

Christos Plachouras
3rd November 2024

AIM Industry Showcase Event

On Tuesday 29^th October, we held an AI and Music Centre for Doctoral Training (AIM CDT) showcase event for our industry partners and collaborators featuring some of the research AIM CDT students do. We introduced the AIM CDT and outlined potential collaboration opportunities, including with C4DM academics and their research interests. We also discussed research ideas, including potential PhD topic and internship proposals.

Several AIM students presented their work:

Franco Caspe “Low-Latency Neural Audio Synthesis for Interactive Performance”.
Xavier Riley “High Resolution Guitar Transcription”.
Jingjing Tang “AI-Driven Transformation of score MIDI into Expressive Piano Performance“, Demo
Soumya Sai Vanka “Context-aware and Controllable Multitrack Music Mixing System”.
Christopher Winnard “Brain-Computer Interface related work with around-the-ear EEG recordings“.

Further information about the AIM CDT can be found here: UKRI Centre for Doctoral Training in Artificial Intelligence and Music. More information about C4DM researchers is available here: c4dm.eecs.qmul.ac.uk/people. You can get in touch with us for further information, including for requesting a recording of this event.

Comments Off InEntrepreneurship Research

Christos Plachouras
28th October 2024

AIM at ISMIR 2024

Logo of ISMIR 2024 conference at San Franscisco On 10-14 November 2024, several AIM researchers will participate at the 25th International Society for Music Information Retrieval Conference (ISMIR 2024). ISMIR is the leading conference in the field of music informatics, and is currently the top-cited publication for Music & Musicology (source: Google Scholar). This year ISMIR will take place onsite in San Francisco (CA, USA) and online.

Similar to previous years, the Centre for Digital Music will have a strong presence at ISMIR 2024.

In the Scientific Programme, the following papers are authored/co-authored by AIM members:

Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning (Ilaria Manco, Justin Salamon, Oriol Nieto)
Between the AI and Me: Analysing Listeners’ Perspectives on AI- and Human-Composed Progressive Metal Music (Pedro Sarmento, Jackson Lothn, Mathieu Barthet)
Can LLMs “Reason” in Music? An Evaluation of LLMs’ Capability of Music Understanding and Generation (Ziya Zhou, Yuhang Wu, Zhiyue Wu, Xinyue Zhang, Ruibin Yuan, Yinghao Ma, Lu Wang, Emmanouil Benetos, Wei Xue, Yike Guo)
ComposerX: Multi-Agent Music Generation with LLMs (Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo)
Content-based Controls for Music Large-scale Language Modeling (Liwei Lin, Gus Xia, Junyan Jiang, Yixiao Zhang)
Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models (Javier Nistal, Marco Pasini, Cyran Aouameur, Maarten Grachten, Stefan Lattner)
Diff-MST: Differentiable Mixing Style Transfer (Soumya Sai Vanka, Christian J. Steinmetz, Jean-Baptiste Rolland, Joshua D. Reiss, George Fazekas)
From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano (Huan Zhang, Jinhua Liang, Simon Dixon)
GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model (Xavier Riley, Zixun Guo, Drew Edwards, Simon Dixon)
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition (Yannis Vasilakis, Rachel Bittner, Johan Pauwels)
MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling (Drew Edwards, Xavier Riley, Pedro Sarmento, Simon Dixon)
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models (Benno Weck, Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas, Dmitry Bogdanov) – Best Paper Nomination
Music2Latent: Consistency Autoencoders for Latent Audio Compression (Marco Pasini, Stefan Lattner, George Fazekas)
Semi-Supervised Contrastive Learning of Musical Representations (Julien Guinot, Elio Quinton, George Fazekas)
SpecMaskGIT: Masked Generative Modelling of Audio Spectrogram for Efficient Audio Synthesis and Beyond (Marco Comunità, Zhi Zhong, Akira Takahashi, Shiqi Yang, Mengjie Zhao, Koichi Saito, Yukara Ikemiya, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji)
ST-ITO: Controlling audio effects for style transfer with inference-time optimization (Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, Joshua D. Reiss) – Best Paper Nomination

The following Tutorial will be presented by AIM PhD student Ilaria Manco:

Connecting Music Audio and Natural Language (Seung Heon Doh, Ilaria Manco, Zachary Novack, Jong Wook Kim and Ke Chen)

The following journal paper published at TISMIR will be presented at the conference:

PiJAMA: Piano Jazz with Automatic MIDI Annotations (Drew Edwards, Simon Dixon, Emmanouil Benetos)

As part of the MIREX public evaluations, AIM PhD student Yixiao Zhang is task captain for the Music Description & Captioning task.

Finally, the following AIM members are organising Satellite Events:

Elona Shatri as General Chair for WoRMS 2024
Ilaria Manco as Organising Committee member for NLP4MUSA 2024

See you at ISMIR!

Comments Off InResearch

Emmanouil Benetos
24th October 2024

AIM student to join the Alan Turing Institute in 2024/25

AIM PhD student Ashley Noel-Hirst has been awarded an enrichment placement by the Alan Turing Institute, the UK’s national institute in artificial intelligence and data science, enabling Ashley to join and interact with institute researchers and its community in the 2024/25 academic year.

Specifically, Ashley’s placement is hosted by the Turing’s Data-Centric Engineering research programme, and will be supported by Prof Drew Hemment, Theme Lead for Humanities, Arts and Social Sciences in Data-Centric Engineering.

Congratulations to Ashley!

Comments Off InResearch

Category Research