facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
PythonNOASSERTION
Stargazers
- adiyossHUJI & FAIR
- aitalk
- akansal1
- atsushiKojima
- breizhnOldenburg
- charlesliucnTsinghua University
- csteinmetz1@suno-ai
- denisfitz57
- doesdev@xelexdigital
- faroitAudioshake
- felixkreukIsrael
- fly51flyPRIS
- ggsonic
- GitHub30Osaka, Japan
- hwong39
- hysiosChina, Changsha
- imbibekkSeoul
- Joel-hanson@ibm
- krmiddlebrookSan Diego
- LearnedVector
- lili-0805Nagoya University
- loretoparisi@Musixmatchdev
- mkachlickaLondon, UK
- numb3r3@jina-ai
- pbayliesDurham, NC
- pranaymanocha
- qmpzzpmqUIM
- railsloes
- rudygtremind.me
- rylokhande
- shoegazerstella@musixmatch @musixmatchresearch
- shyamsn97
- simonefrancia@musixmatch @Musixmatchdev
- sudhamstarunBalyasny Asset Management L.P.
- tuan3wHanoi
- vd-v@LLNL