Pinned Repositories
cursoteoriadacomputacao
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
exemplo_recipes
udemy course django
flask_ajax_jquery
The code for all the YouTube videos published on YouTube.
monk_v1
Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.
wav_split
PYTHON: Split a wav file with a start and end time.
ZipWavExtract
Extracting Audio/Speech files in wav format from ZIP files.
LuisOtavioSantos's Repositories
LuisOtavioSantos/ZipWavExtract
Extracting Audio/Speech files in wav format from ZIP files.
LuisOtavioSantos/cursoteoriadacomputacao
LuisOtavioSantos/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
LuisOtavioSantos/exemplo_recipes
udemy course django
LuisOtavioSantos/flask_ajax_jquery
The code for all the YouTube videos published on YouTube.
LuisOtavioSantos/monk_v1
Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.
LuisOtavioSantos/wav_split
PYTHON: Split a wav file with a start and end time.