neural-tts

There are 20 repositories under neural-tts topic.

keonlee9420/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Language:Python328 20 2936
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Language:Python318 12 2041
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python310 9 2744
KevinMIN95/StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Language:Python238 6 2139
keonlee9420/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Language:Python229 4 830
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Language:Python190 6 1723
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language:Python188 13 1944
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language:Python181 6 1726
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language:Python143 11 519
keonlee9420/VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Language:Python72 4 314
keonlee9420/FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Language:Python71 2 314
keonlee9420/WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Language:Python66 6 516
mush42/sonata
A cross-platform inference engine for neural TTS models.
Language:Rust63 11 712
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Language:Python56 3 713
keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Language:Python45 4 1015
keonlee9420/Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
Language:Python15 1 0
Mobile-Artificial-Intelligence/babylon.cpp
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
Language:Python10 3 01
yokawasa/vscode-translator-voice
VS Code extension for multi-language text translation and TTS (text-to-speech) using Azure Cognitive Services. Please [✩Star] if you're using it!
Language:TypeScript7 4 34
QuantiusBenignus/voluble
Let your GNOME desktop speak to you. Reads your desktop notifications out-loud with human-like voice using Piper.
Language:JavaScript2 1 00
marcel2215/native-speaker
A simple Discord bot that synthesizes speech directly to a voice channel via text commands with support for sound effects.
Language:Python0 1 00

neural-tts

keonlee9420/PortaSpeech

keonlee9420/Comprehensive-Transformer-TTS

keonlee9420/DiffGAN-TTS

KevinMIN95/StyleSpeech

keonlee9420/DiffSinger

keonlee9420/StyleSpeech

keonlee9420/Parallel-Tacotron2

keonlee9420/Cross-Speaker-Emotion-Transfer

keonlee9420/Comprehensive-E2E-TTS

keonlee9420/VAENAR-TTS

keonlee9420/FastPitchFormant

keonlee9420/WaveGrad2

mush42/sonata

keonlee9420/Daft-Exprt

keonlee9420/Comprehensive-Tacotron2

keonlee9420/Deep-Learning-TTS-Template

Mobile-Artificial-Intelligence/babylon.cpp

yokawasa/vscode-translator-voice

QuantiusBenignus/voluble

marcel2215/native-speaker