Pinned Repositories
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
fast-bert
Super easy library for BERT based NLP models
hello-world
testing
lecidhugo.github.io
lecid blog
nbdev_template
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Resemblyzer
A python package to analyze and compare voices with deep learning
lecidhugo's Repositories
lecidhugo/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
lecidhugo/fast-bert
Super easy library for BERT based NLP models
lecidhugo/hello-world
testing
lecidhugo/lecidhugo.github.io
lecid blog
lecidhugo/nbdev_template
lecidhugo/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
lecidhugo/Resemblyzer
A python package to analyze and compare voices with deep learning