Pinned Repositories
AMS
amplitude modulation spectrum
BasicTools
collections of basic tools
Bianural-cues
python code for binaural cues(ITD,ILD) calculation
Gammatone-filters
Python implementation of Gammatone filter
GCC-PHAT_DNN_Loc
DNN based binaural sound localization model, using GCC-PHAT as features
MVDR-beamformer
MVDR beamformer written in python
RASTA-PLP
Relative Spectral Transform-Perceptual Linear Prediction
Roomsim_Campbell
Roomsim_Campbell
RoomSimulator
WaveLoc
End-to-End binaural sound localization
bingo-todd's Repositories
bingo-todd/Gammatone-filters
Python implementation of Gammatone filter
bingo-todd/GCC-PHAT_DNN_Loc
DNN based binaural sound localization model, using GCC-PHAT as features
bingo-todd/WaveLoc
End-to-End binaural sound localization
bingo-todd/MVDR-beamformer
MVDR beamformer written in python
bingo-todd/BasicTools
collections of basic tools
bingo-todd/Roomsim_Campbell
Roomsim_Campbell
bingo-todd/Bianural-cues
python code for binaural cues(ITD,ILD) calculation
bingo-todd/RoomSimulator
bingo-todd/GMM_Localize
bingo-todd/GMMs
Gaussian Mixture Mode written in Python
bingo-todd/sparse-NMF
bingo-todd/awesome-english-ebooks
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
bingo-todd/bi-lstm-crf-ner-tf2.0
Named Entity Recognition (NER) task using Bi-LSTM-CRF model implemented in Tensorflow 2.0(tensorflow2.0 +)
bingo-todd/BinauralLocalizationCNN
Code to create networks that localize sounds sources in 3D environments
bingo-todd/bingo-todd.github.io
pages
bingo-todd/BlindEstRT
bingo-todd/BlindRT
Blind Estimation of Reveberation Time from speech and music
bingo-todd/Conv-Tasnet-for-speech-enchancement-and-seperation
The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation
bingo-todd/DNN_binaural_localization
binaural sound localization, DNN model with feature of GCC-PHAT
bingo-todd/FDY-SED
bingo-todd/ICASSP-latex-template
ICASSP latex template (2021)
bingo-todd/jekyll-theme-prologue
A Jekyll version of the "Prologue" theme by HTML5 UP
bingo-todd/LocTools
bingo-todd/magenta
Magenta: Music and Art Generation with Machine Intelligence
bingo-todd/preprint-template.tex
A template for two-column scientific preprints
bingo-todd/pySOFA
Python API for SOFA (Spatially Oriented Format for Acoustics)
bingo-todd/rir_simulator_python
Room impulse response simulator using python
bingo-todd/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
bingo-todd/tcnse
TCN-based Speech Enhancement
bingo-todd/WaveLoc_EC
An end-to-end binaural sound localization model based on the equalization and cancellation theory