/mtl_sed_asc

Joint analysis of sound events and acoustic scenes based on multitask learning

Primary LanguagePython

Joint Analysis of Sound Events and Acoustic Scenes Based on Multitask Learning

Sound event detection (SED) and acoustic scene classification (ASC) are major research tasks in environmental sound analysis. Conventional methods have addressed these tasks separately; however, acoustic events and scenes are closely related to each other. For example, in the acoustic scene "office," the sound events "mouse clicking" and "keyboard typing" tend to occur. This repository provides the implementation for joint analysis of sound events and acoustic scenes, which have been applied in [1][2].

See also this repository.

[1] N. Tonami, K. Imoto, M. Niitsuma, R. Yamanishi, and Y. Yamashita, "Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning," Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 333-337, 2019.
[2] N. Tonami, K. Imoto, R. Yamanishi, and Y. Yamashita, "Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning," IEICE Transactions on Information and Systems, Vol. E104-D, No. 02, pp. 294-301, 2021.