/AV-SELD

Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"

Primary LanguagePythonMIT LicenseMIT

Watchers