OmniBench: Towards The Future of Universal Omni-Language Models Paper • 2409.15272 • Published Sep 23 • 25
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response Paper • 2309.08730 • Published Sep 15, 2023 • 1
MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning Paper • 2310.09853 • Published Oct 15, 2023
The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation Paper • 2311.10057 • Published Nov 16, 2023 • 1
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training Paper • 2306.00107 • Published May 31, 2023 • 3
Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities Paper • 2312.00249 • Published Nov 30, 2023 • 1
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT Paper • 2306.17103 • Published Jun 29, 2023 • 1
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25 • 56
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series Paper • 2405.19327 • Published May 29 • 46
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation Paper • 2407.04822 • Published Jul 5 • 1
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models Paper • 2408.01337 • Published Aug 2 • 10