/FSD50K_baseline

Baseline systems for the FSD50K dataset

MIT LicenseMIT

FSD50K_baseline

This repository will contain the code for the baseline experiments included in the following paper. If you use this code or part of it, please cite:

Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra, "FSD50K: an Open Dataset of Human-Labeled Sound Events", arXiv:2010.00475, 2020.

This repository will contain a framework that comprises all the basic stages in supervised sound event classifcation: feature extraction, training, inference and evaluation. After loading the FSD50K dataset, log-mel energies are computed and several baselines can be trained and evaluated. Please check our paper for more details. The system is implemented in TensorFlow.

Code will be made available in the final version of the paper, hopefully before the end of 2021. We will announce when it is available via twitter and via the freesound-annotator Google Group.

In the meantime, make sure to take a look at the resources we just released:

Stay tuned!

Reference

@article{fonseca2020fsd50k,
  title={{FSD50K}: an Open Dataset of Human-Labeled Sound Events},
  author={Fonseca, Eduardo and Favory, Xavier and Pons, Jordi and Font, Frederic and Serra, Xavier},
  journal={arXiv preprint arXiv:2010.00475},
  year={2020}
}