LuisOtavioSantos

BRASIL

Pinned Repositories

cursoteoriadacomputacao
00
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python00
exemplo_recipes
udemy course django
Language:Python00
flask_ajax_jquery
The code for all the YouTube videos published on YouTube.
Language:JavaScript00
monk_v1
Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.
Language:Jupyter Notebook00
wav_split
PYTHON: Split a wav file with a start and end time.
Language:Python00
ZipWavExtract
Extracting Audio/Speech files in wav format from ZIP files.
Language:Python11

LuisOtavioSantos/ZipWavExtract
Extracting Audio/Speech files in wav format from ZIP files.
Language:Python11
LuisOtavioSantos/cursoteoriadacomputacao
00
LuisOtavioSantos/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python00
LuisOtavioSantos/exemplo_recipes
udemy course django
Language:Python00
LuisOtavioSantos/flask_ajax_jquery
The code for all the YouTube videos published on YouTube.
Language:JavaScript00
LuisOtavioSantos/monk_v1
Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.
Language:Jupyter Notebook00
LuisOtavioSantos/wav_split
PYTHON: Split a wav file with a start and end time.
Language:Python00