fastspeech2

There are 29 repositories under fastspeech2 topic.

open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4k 53 112337
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Language:Python3.7k 78 682800
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
Language:Python599 29 6183
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
Language:Python449 8 39105
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Language:Python314 14 1941
Executedone/Chinese-FastSpeech2
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
Language:Python222 7 2135
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
Language:Jupyter Notebook217 10 1252
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
Language:C++207 15 920
rishikksh20/AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Language:Jupyter Notebook157 7 1140
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language:Python141 11 519
tuanh123789/AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
Language:Python96 3 1727
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
Language:Python91 8 617
rishikksh20/LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Language:Python79 9 57
xcmyz/FastSpeech2
The Implementation of FastSpeech2 Based on Pytorch.
Language:Python53 5 48
hwRG/End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
Language:Python24 3 18
Adibian/ResGrad
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Language:Python14
AppleHolic/FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
Language:Python13 2 22
deepaudio/deepaudio-tts
Language:Python12 4 02
dathudeptrai/FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
11 3 10
alessandropec/data_driven_ai_voice_cloning
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Language:Python8 3 11
hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
Language:Python8 1 11
ssmlkl/MnTTS2
This is the experimental description of MnTTS2.
Language:Jupyter Notebook7 2 42
nikolaStanojkovski/Assistive_Bus_Helper
An Android application that allows visually impaired people to hear which bus lines are passing next to them.
Language:Python2 1 01
lordzuko/SpeakingStyle
Aligning latent space of speaking style with human perception using a re-embedding strategy
Language:Jupyter Notebook1 1 00
nikolaStanojkovski/Talk_Through_Me
An Android application that acts as a speaking assistant for the hearing impaired people.
Language:Python1
quackson/DG_HW
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
Language:Python1 1 00
utkarsh2299/Fastspeech2_HS
Created this repo as a part of the project "Speech Technologies in Indian languages". About Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
Language:Perl0 1 00
gagan3012/image2audio
Convert Image to audio using ViT, GPT and FastSpeech
Language:Python3 0
lordzuko/FastSpeech2-jax
Implementation of FastSpeech2 in JAX
1 0

fastspeech2

open-mmlab/Amphion

TensorSpeech/TensorFlowTTS

PaddlePaddle/Parakeet

ranchlai/mandarin-tts

keonlee9420/Comprehensive-Transformer-TTS

Executedone/Chinese-FastSpeech2

rishikksh20/FastSpeech2

ZDisket/TensorVox

rishikksh20/AdaSpeech

keonlee9420/Comprehensive-E2E-TTS

tuanh123789/AdaSpeech

ga642381/FastSpeech2

rishikksh20/LightSpeech

xcmyz/FastSpeech2

hwRG/End-to-End-TTS-Fine-Tune

Adibian/ResGrad

AppleHolic/FastSpeech2

deepaudio/deepaudio-tts

dathudeptrai/FastSpeech2

alessandropec/data_driven_ai_voice_cloning

hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker

ssmlkl/MnTTS2

nikolaStanojkovski/Assistive_Bus_Helper

lordzuko/SpeakingStyle

nikolaStanojkovski/Talk_Through_Me

quackson/DG_HW

utkarsh2299/Fastspeech2_HS

gagan3012/image2audio

lordzuko/FastSpeech2-jax