Pinned Repositories
FFDConv
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
MFF-EINV2
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
MTDA
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Harper812's Repositories
Harper812/FFDConv
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
Harper812/MFF-EINV2
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
Harper812/MTDA
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection