ViTTS: A Python repository from namtrt

🎁ViTTS is a library for advanced Text-to-Speech generation for multi language such as: chinese, japanese, vietnamese .

🎁ViTTS builts on the latest research, It designed to achieve the best trade-off among ease of training, inference and evaluate.

🎁ViTTS is a library for text to speech, it achive performance in speech and quality.

💬 Where to ask questions

Please use our dedicated channels for questions and discussion. Help is much more valuable if it's shared publicly so that more people can benefit from it.

Type	Platforms
🚨 Bug Reports	GitHub Issue
🎁 Feature Requests & Ideas	GitHub Issue
👩‍💻 Usage Questions	Github Discussions
🗯 General Discussion	Linkedin or Gitter Room

🔗 Links and Resources

Type	Links
💼 Documentation	ReadTheDocs
💾 Installation	TTS/README.md
👩‍💻 Contributing	CONTRIBUTING.md
📌 Road Map	Main Development Plans

Implemented Models

Text-to-Spectrogram

Tacotron: paper
Tacotron2: paper
Glow-TTS: paper
Speedy-Speech: paper
Align-TTS: paper
FastPitch: paper
FastSpeech: paper

End-to-End Models

VITS: paper

Attention Methods

Guided Attention: paper
Forward Backward Decoding: paper
Graves Attention: paper
Double Decoder Consistency: blog
Dynamic Convolutional Attention: paper
Alignment Network: paper

Speaker Encoder

GE2E: paper
Angular Loss: paper

Vocoders

MelGAN: paper
MultiBandMelGAN: paper
ParallelWaveGAN: paper
GAN-TTS discriminators: paper
WaveRNN: origin
WaveGrad: paper
HiFiGAN: paper
UnivNet: paper

You can also help us implement more models.

namtrt/ViTTS