DNLE: Dataset for Noise Level Estimation

The DNLE dataset contains 1668 environmental background noise recordings labeled according to type and level of noise. The recordings are approximately equally balanced between three main categories, i.e., "mechanic", "melodic", and "quiet". These noise categories were selected as we found in a previous study that these are the background noises that can distract users in crowdsourcing when performing tasks [1] [2]. The recordings were collected through the audio-web API employing different Windows and Mac computers.

The loudness measurements were 50.7dB(A) on average, varying from 30.6dB(A) to 81.3dB(A).

Noise Classes	Noise Category	Files per Category	Loudness Average (dBA)	min / max dBA
street, traffic dishwasher	mechanic	576	58.6	46.5 / 70.0
TV, TV-Show music, radio	melodic	580	59.6	32.0 / 81.3
quiet	quiet	512	31.4	30.6 / 33.0

Repository content

audio/*.wav

1668 audio recordings in WAV format (15.0 seconds long on average, 48.0 kHz, stereo) with the following naming convention:

{NOISE_TYPE} _ {TIMESTAMP} _ {NOISE_LEVEL} _ {ID}.wav
- {NOISE_TYPE} - type of environmental background noise,
- {NOISE_LEVEL} - noise level measured in dBA,
FeatureGenerator

Code to extract different spectral and chroma features from the audio files. The output is saved as *.csv. The code is optimized to run parallel.

LevelEstimator

Code to train and test the model based on a "Long Short-Term Memory" (LSTM) architecture in PyTorch.

Citing

If you find this dataset useful in an academic setting please cite: (to be updated)

Download

The dataset can be downloaded as a single .zip file (~2.38 GB):

Download DNLE dataset

References

[1] R. Zequeira Jiménez, B. Naderi, and S. Möller, "Background Environment Characteristics of Crowd-Workers from German Speaking Countries Experimental Survey on User Environment Characteristics," in 2019 Eleventh International Conference on Quality of Multimedia Experience (QoMEX), 2019, pp. 1–3. [DOI: https://doi.org/10.1109/QoMEX.2019.8743208]

[2] R. Zequeira Jiménez, B. Naderi, and S. Möller, "Effect of Environmental Noise in Speech Quality Assessment Studies using Crowdsourcing," in 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX), 2020, pp. 1–6. [DOI: https://doi.org/10.1109/QoMEX48832.2020.9123144]

zequeira/DNLE

DNLE: Dataset for Noise Level Estimation

Repository content

Citing

Download

References