donghaiyw

Pinned Repositories

armspeech
Autoregressive probabilistic modelling for speech synthesis.
Language:Python0 1 00
auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python0 1 00
covarep
A Cooperative Voice Analysis Repository for Speech Technologies
Language:MATLAB0 1 00
CTC-Connectionist-Temporal-Classification
Theano implementation of CTC.
Language:Python0 1 00
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Language:Python0 1 00
deepvoice3
Tensorflow Implementation of Deep Voice 3
Language:Python0 1 00
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 1 00
DNN-SpeechEnhancement
Language:Python0 1 00
espnet
End-to-End Speech Processing Toolkit
Language:Shell0 1 00
FastSpeech
The Implementation of FastSpeech based on pytorch.
Language:Python0 1 00

donghaiyw's Repositories

donghaiyw/auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python0 1 00
donghaiyw/covarep
A Cooperative Voice Analysis Repository for Speech Technologies
Language:MATLAB0 1 00
donghaiyw/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 1 00
donghaiyw/espnet
End-to-End Speech Processing Toolkit
Language:Shell0 1 00
donghaiyw/FastSpeech
The Implementation of FastSpeech based on pytorch.
Language:Python0 1 00
donghaiyw/hifi-gan
donghaiyw/legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
Language:Matlab1 0
donghaiyw/LPCTorch
LPC Utility for Pytorch Library.
Language:Python1 0
donghaiyw/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Language:Python1 0
donghaiyw/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Language:Python1 0
donghaiyw/nnAudio
Audio processing by using pytorch 1D convolution network
Language:Jupyter Notebook1 0
donghaiyw/paper-reading
深度学习经典、新论文逐段精读
1 0
donghaiyw/ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
Language:Python1 0
donghaiyw/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
Language:Python1 0
donghaiyw/pytorch-handbook
pytorch handbook是一本开源的书籍，目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门，其中包含的Pytorch教程全部通过测试保证可以成功运行
Language:Jupyter Notebook1 0
donghaiyw/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
Language:Python1 0
donghaiyw/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python1 0
donghaiyw/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
Language:Python1 0
donghaiyw/sonnet
TensorFlow-based neural network library
Language:Python1 0
donghaiyw/speech-denoiser
A speech denoise lv2 plugin based on RNNoise library
Language:C1 0
donghaiyw/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Language:Python1 0
donghaiyw/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
Language:Python
donghaiyw/SqueezeWave
Language:Python1 0
donghaiyw/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Python1 0
donghaiyw/UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
Language:Python1 0
donghaiyw/waveglow
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Language:Python1 0
donghaiyw/wavegrad
A fast, high-quality neural vocoder.
Language:Python1 0
donghaiyw/WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
Language:Jupyter Notebook1 0
donghaiyw/WaveRNN-1
A WaveRNN implementation
Language:Python1 0
donghaiyw/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
Language:Python1 0