bingo-todd

Beijing, China

Pinned Repositories

AMS
amplitude modulation spectrum
Language:Python11
BasicTools
collections of basic tools
Language:Python7 2 00
Bianural-cues
python code for binaural cues(ITD,ILD) calculation
Language:Jupyter Notebook40
Gammatone-filters
Python implementation of Gammatone filter
Language:Python24 3 13
GCC-PHAT_DNN_Loc
DNN based binaural sound localization model, using GCC-PHAT as features
Language:Python18 2 36
MVDR-beamformer
MVDR beamformer written in python
Language:Python92
RASTA-PLP
Relative Spectral Transform-Perceptual Linear Prediction
Language:Python6 1 00
Roomsim_Campbell
Roomsim_Campbell
Language:MATLAB7 2 03
RoomSimulator
Language:Python2 1 00
WaveLoc
End-to-End binaural sound localization
Language:Python142

bingo-todd's Repositories

bingo-todd/Gammatone-filters
Python implementation of Gammatone filter
Language:Python24 3 13
bingo-todd/GCC-PHAT_DNN_Loc
DNN based binaural sound localization model, using GCC-PHAT as features
Language:Python18 2 36
bingo-todd/WaveLoc
End-to-End binaural sound localization
Language:Python142
bingo-todd/MVDR-beamformer
MVDR beamformer written in python
Language:Python92
bingo-todd/BasicTools
collections of basic tools
Language:Python7 2 00
bingo-todd/Roomsim_Campbell
Roomsim_Campbell
Language:MATLAB7 2 03
bingo-todd/Bianural-cues
python code for binaural cues(ITD,ILD) calculation
Language:Jupyter Notebook40
bingo-todd/RoomSimulator
Language:Python2 1 00
bingo-todd/GMM_Localize
Language:Python1
bingo-todd/GMMs
Gaussian Mixture Mode written in Python
Language:Python1
bingo-todd/sparse-NMF
Language:Python1
bingo-todd/awesome-english-ebooks
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
bingo-todd/bi-lstm-crf-ner-tf2.0
Named Entity Recognition (NER) task using Bi-LSTM-CRF model implemented in Tensorflow 2.0(tensorflow2.0 +)
bingo-todd/BinauralLocalizationCNN
Code to create networks that localize sounds sources in 3D environments
Language:Python1 01
bingo-todd/bingo-todd.github.io
pages
Language:HTML
bingo-todd/BlindEstRT
Language:Python1
bingo-todd/BlindRT
Blind Estimation of Reveberation Time from speech and music
Language:MATLAB1
bingo-todd/Conv-Tasnet-for-speech-enchancement-and-seperation
The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation
bingo-todd/DNN_binaural_localization
binaural sound localization, DNN model with feature of GCC-PHAT
bingo-todd/FDY-SED
bingo-todd/ICASSP-latex-template
ICASSP latex template (2021)
Language:TeX
bingo-todd/jekyll-theme-prologue
A Jekyll version of the "Prologue" theme by HTML5 UP
bingo-todd/LocTools
Language:Python
bingo-todd/magenta
Magenta: Music and Art Generation with Machine Intelligence
bingo-todd/preprint-template.tex
A template for two-column scientific preprints
Language:TeX1 0
bingo-todd/pySOFA
Python API for SOFA (Spatially Oriented Format for Acoustics)
Language:Python
bingo-todd/rir_simulator_python
Room impulse response simulator using python
Language:Python
bingo-todd/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
bingo-todd/tcnse
TCN-based Speech Enhancement
bingo-todd/WaveLoc_EC
An end-to-end binaural sound localization model based on the equalization and cancellation theory
Language:Python1