sound-source-localization
There are 26 repositories under sound-source-localization topic.
aishoot/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Audio-WestlakeU/RealMAN
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
BrownsugarZeer/Multi_SSL
Combine sound source separation with SRP-PHAT to achieve multi-source localization.
BingYang-20/SRP-DNN
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
BingYang-20/DP-RTF-Learning
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
ishaaniwani/GCC-PHAT-SSL
MATLAB Simulation Framework For Basic Sound Source Localization Using the GCC PHAT Algorithm
axeber01/wav2pos
3D Sound Source Localization using Masked Autoencoders
RobertoAlessandri/CNN_DOA
Test of the ability of a Convolutional Neural Network (CNN) trained to localize the Direction Of Arrival (DOA), to generalize in different environments.
stoneMo/OneAVM
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
ZahraBenslimane/sound_source_localization_with_beamforming
Localization of a sound source using a microphone array and beamforming technics
BingYang-20/TF-Wise-Spatial-Spectrum-Clustering
A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]
sutdcv/Chaotic-World
[ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events
linfeng-feng/Unbiased_Label_Distribution
Eliminating Quantization Errors in Classification-Based Sound Source Localization
wattai/sound-source-position-estimation
This scripts estimate Sound Source Position based on Cross-power Spectrum Phase (CSP) or Multiple Signal Classification (MUSIC).
ly-zhu/Leveraging-Category-Information-for-Single-Frame-Visual-Sound-Source-Separation
PyTorch implementation of "Leveraging Category Information for Single-Frame Visual Sound Source Separation"
ly-zhu/cof-net
Code for the paper: Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Gl0dny/hexapod
This project develops an autonomous hexapod robot using auditory scene analysis for navigation. It integrates sound source localization (DOA) and beamforming via ODAS with a circular microphone array for precise spatial detection. A machine learning-based Keyword Spotting (KWS) module enables voice command recognition for human-robot interaction.
ly-zhu/ly-zhu.github.io
Projects webpage
ishaaniwani/SpeechProcessor
Program that takes multiple wav files and processes them so that they can be recognized.
dasdristanta13/2.5D-Visual-Sound
Visualising Sound
MaloOLIVIER/hungarian-net
Hungarian Network 🔬 — Generate synthetic data and train your deep-learning implementation of the Hungarian algorithm.
yhzh05/DFLNet
Code
cid2rrrr/BAaV
Official Codebase of "Bridging Audio and Vision: Zero-Shot Audiovisual Segmentation by Connecting Pretrained Models" (Interspeech 2025)
nwpugq/sound-source-localization
sound-source-localization