ESC50 with TensorFlow

The ESC50 dataset is a small audioset of 2000 audio samples, each 5 seconds long. It's available from GitHub in raw audio (i.e., wave format) form.

This repository contains a small script that reads the audio files and writes them to TFRecords, TensorFlow's way of efficient data storage and streaming. Additionally, it contains sample code to read the data from the TFRecords into a dataset.

Download the dataset from the above link, and then run the builder script with the following arguments:

--dataset_dir pointing to the dataset's audio directory

--out_dir pointing to a directory to store the TFRecord files

like so: python path/to/script.py --dataset_dir path/to/dir --out_dir path/to/dir