Pinned Repositories
Attentions-in-Tacotron
BunchedLPCnet
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
ddsp
DDSP: Differentiable Digital Signal Processing
DL-Art-School
DLAS - A configuration-driven trainer for generative models
efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Tacotron2
BridgetteSong's Repositories
BridgetteSong/ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
BridgetteSong/BunchedLPCnet
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
BridgetteSong/Tacotron2
BridgetteSong/ddsp
DDSP: Differentiable Digital Signal Processing
BridgetteSong/Attentions-in-Tacotron
BridgetteSong/DL-Art-School
DLAS - A configuration-driven trainer for generative models
BridgetteSong/efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
BridgetteSong/hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
BridgetteSong/LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
BridgetteSong/multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
BridgetteSong/Parallel-Tacotron2
Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
BridgetteSong/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
BridgetteSong/Robust_Fine_Grained_Prosody_Control
Pytorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis (Unofficial)
BridgetteSong/SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
BridgetteSong/STYLER
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech
BridgetteSong/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
BridgetteSong/TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
BridgetteSong/TTS_TFLite
This repository is a collection of TTS Models in TFLite
BridgetteSong/UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
BridgetteSong/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
BridgetteSong/VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021