Pinned Repositories
AV-SELD
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
Leveraging-Visual-Supervision-for-Array-based-ASDL
Code for "Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization"
TragicTakers_utils
Ready-to-use python scripts to get started with the TragicTalkers dataset
dberghi's Repositories
dberghi/AV-SELD
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
dberghi/Leveraging-Visual-Supervision-for-Array-based-ASDL
Code for "Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization"
dberghi/TragicTakers_utils
Ready-to-use python scripts to get started with the TragicTalkers dataset