Pinned Repositories
armspeech
Autoregressive probabilistic modelling for speech synthesis.
auraloss
Collection of audio-focused loss functions in PyTorch
covarep
A Cooperative Voice Analysis Repository for Speech Technologies
CTC-Connectionist-Temporal-Classification
Theano implementation of CTC.
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
deepvoice3
Tensorflow Implementation of Deep Voice 3
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
DNN-SpeechEnhancement
espnet
End-to-End Speech Processing Toolkit
FastSpeech
The Implementation of FastSpeech based on pytorch.
donghaiyw's Repositories
donghaiyw/auraloss
Collection of audio-focused loss functions in PyTorch
donghaiyw/covarep
A Cooperative Voice Analysis Repository for Speech Technologies
donghaiyw/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
donghaiyw/espnet
End-to-End Speech Processing Toolkit
donghaiyw/FastSpeech
The Implementation of FastSpeech based on pytorch.
donghaiyw/hifi-gan
donghaiyw/legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
donghaiyw/LPCTorch
LPC Utility for Pytorch Library.
donghaiyw/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
donghaiyw/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
donghaiyw/nnAudio
Audio processing by using pytorch 1D convolution network
donghaiyw/paper-reading
深度学习经典、新论文逐段精读
donghaiyw/ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
donghaiyw/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
donghaiyw/pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
donghaiyw/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
donghaiyw/Resemblyzer
A python package to analyze and compare voices with deep learning
donghaiyw/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
donghaiyw/sonnet
TensorFlow-based neural network library
donghaiyw/speech-denoiser
A speech denoise lv2 plugin based on RNNoise library
donghaiyw/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
donghaiyw/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
donghaiyw/SqueezeWave
donghaiyw/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
donghaiyw/UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
donghaiyw/waveglow
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
donghaiyw/wavegrad
A fast, high-quality neural vocoder.
donghaiyw/WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
donghaiyw/WaveRNN-1
A WaveRNN implementation
donghaiyw/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)