jhkonan

Pinned Repositories

opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
Language:C++602 19 6877
deeplearning.cs.cmu.edu
11-785 Website
0 1 00
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 0 00
jhkonan.github.io
Fall 2019 Website for 11-785, Introduction to Deep Learning
Language:JavaScript2 2 01
opensmile-python
Python package for openSMILE
Language:PHP0 0 00
personal-website
Personal website for Joseph Konan
0 0 00
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++0 0 00
Spring2019_Tutorials
Language:Jupyter Notebook0 0 00
VoIP-DNS-Challenge
0 1 00
FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Language:Python247 5 2855

jhkonan's Repositories

jhkonan/jhkonan.github.io
Fall 2019 Website for 11-785, Introduction to Deep Learning
Language:JavaScript2 2 01
jhkonan/deeplearning.cs.cmu.edu
11-785 Website
0 1 00
jhkonan/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 0 00
jhkonan/opensmile-python
Python package for openSMILE
Language:PHP0 0 00
jhkonan/personal-website
Personal website for Joseph Konan
0 0 00
jhkonan/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++0 0 00
jhkonan/Spring2019_Tutorials
Language:Jupyter Notebook0 0 00
jhkonan/VoIP-DNS-Challenge
0 1 00