fastspeech2
There are 29 repositories under fastspeech2 topic.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Executedone/Chinese-FastSpeech2
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
rishikksh20/AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
tuanh123789/AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
rishikksh20/LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
xcmyz/FastSpeech2
The Implementation of FastSpeech2 Based on Pytorch.
hwRG/End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
Adibian/ResGrad
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
AppleHolic/FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
dathudeptrai/FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
alessandropec/data_driven_ai_voice_cloning
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
ssmlkl/MnTTS2
This is the experimental description of MnTTS2.
nikolaStanojkovski/Assistive_Bus_Helper
An Android application that allows visually impaired people to hear which bus lines are passing next to them.
lordzuko/SpeakingStyle
Aligning latent space of speaking style with human perception using a re-embedding strategy
nikolaStanojkovski/Talk_Through_Me
An Android application that acts as a speaking assistant for the hearing impaired people.
quackson/DG_HW
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
utkarsh2299/Fastspeech2_HS
Created this repo as a part of the project "Speech Technologies in Indian languages". About Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
gagan3012/image2audio
Convert Image to audio using ViT, GPT and FastSpeech
lordzuko/FastSpeech2-jax
Implementation of FastSpeech2 in JAX