[ISMIR 2023] Official code for the paper: LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT. Welcome to visit our m-a-p community and MARBLE benchmark for more details.
We introduce the first large-scale, weakly supervised, and copyright-free multilingual lyric transcription dataset, MulJam, consisting of 6,031 songs with 182,429 lines and a total duration of 381.9 hours. The dataset is placed under the ./MulJam
folder.
To install the dependencies, run the following command:
pip install -r requirements.txt
To transcribe lyrics using Whisper, run the following command:
sh code/run.sh
To post-process the Whisper output using ChatGPT, run the corresponding Python script in the ./code
folder:
MulJam v2.0 differs in two aspects:
- We have increased the test set size and collected refined human lyrics annotations, leading to adjustments in the data split.
- We have excluded the songs from the training and validation sets that were present in Jamendo. This exclusion enables a valid comparison with other lyrics transcription systems on the Jamendo dataset.
For a detailed description of MulJam v2.0, please refer to appendix E of the MARBLE benchmark paper.
The script for downloading and preprocessing audio will be available in the MARBLE benchmark repository.
We would like to give special thanks to the following researchers or musicians on the lyrics labelling: Léo Nebel in LIP6 at Sorbonne Université; Nick Magal in School of Music at Carnegie Mellon University; Carey Bunk, Yannis Vasilakis, Christopher Mitcheltree, Nelly Victoria Alexandra Garcia-Sihuay, Teresa Pelinski Ramos, Ilaria Manco, David Südholt, Jordan Shier, and Matthew Rice in Centre for Digital Music at Queen Mary University of London; Emilian Postolache in Sapienza University of Rome; as well as Wenqin Yu in the Chinese Music Institute at Peking University.
If you find this repository useful, please cite our paper and the MARBLE benchmark paper:
@article{zhuo2023lyricwhiz,
title={LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT},
author={Zhuo, Le and Yuan, Ruibin and Pan, Jiahao and Ma, Yinghao and LI, Yizhi and Zhang, Ge and Liu, Si and Dannenberg, Roger and Fu, Jie and Lin, Chenghua and others},
journal={arXiv preprint arXiv:2306.17103},
year={2023}
}
@article{yuan2023marble,
title={MARBLE: Music Audio Representation Benchmark for Universal Evaluation},
author={Yuan, Ruibin and Ma, Yinghao and Li, Yizhi and Zhang, Ge and Chen, Xingran and Yin, Hanzhi and Zhuo, Le and Liu, Yiqi and Huang, Jiawen and Tian, Zeyue and others},
journal={arXiv preprint arXiv:2306.10548},
year={2023}
}