Collection of datasets related to ML, AI, NLP and DL
- ACOUSTICBRAINZ: Crowdsourced acoustic information of songs available under open licenses.
- FREESOUND DATASETS: Platform for the collaborative creation of open audio collections from Freesound.
- DUNYA: Music corpora of several non-western music repertoires and related software tools.
- COMPMUSIC datasets: Collection of datasets of several non-western music repertoires.
- REPOVIZZ: Data repository and visualization tool for music performance multi-modal recordings.
- DREANSS: Annotations of drum events within known music audio recordings datasets.
- EEP: Multimodal recordings of string quartet performances.
- FLABASE: Knowledge Base of flamenco music.
- GIANTSTEPS Key: Key annotations of a music audio collection.
- GIANTSTEPS Tempo: Tempo annotations of a music audio collection.
- GOOD-SOUNDS: Recordings of single notes and scales played by several instruments.
- IRMAS: Musical audio excerpts with annotations of the predominant instruments.
- Last.fm Dataset 360k users - Last.fm Dataset 1k users: <user, artist-mbid, artist-name, total-plays> tuples from Last.fm.
- MARD: Text and accompanying metadata of Amazon customer reviews.
- MASS: Multi-track recordings for audio source separation research.
- MTG-QBH: Recordings of sung melodies for Query-by-Humming research.
- ORCHSET: Orchestral music excerpts with annotations for melody extraction research.
- PHENICX-Anechoic: Denoised recordings and note annotations for Aalto anechoic orchestral database.
- PHENICX-emotion: Excerpts of the Eroica Symphony by Beethoven plus audio descriptors.
- QUARTET: Multimodal data of string quartet performances.
- SAS: List of artists and biographical information for semantic artist similarity research.
- TONAS: Flamenco a cappella sung melodies with manual transcriptions.
- Haydn Quartets: Scores and harmonic annotations of Haydn's String Quartets Op. 20.