Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
Primary LanguagePythonMIT LicenseMIT