fastspeech2

There are 29 repositories under fastspeech2 topic.

  • Amphion

    open-mmlab/Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

    Language:Python4k53112337
  • TensorSpeech/TensorFlowTTS

    :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

    Language:Python3.7k78682800
  • PaddlePaddle/Parakeet

    PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

    Language:Python599296183
  • ranchlai/mandarin-tts

    Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

    Language:Python449839105
  • keonlee9420/Comprehensive-Transformer-TTS

    A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

    Language:Python314141941
  • Executedone/Chinese-FastSpeech2

    基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

    Language:Python22272135
  • rishikksh20/FastSpeech2

    PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

    Language:Jupyter Notebook217101252
  • ZDisket/TensorVox

    Desktop application for neural speech synthesis written in C++

    Language:C++20715920
  • rishikksh20/AdaSpeech

    AdaSpeech: Adaptive Text to Speech for Custom Voice

    Language:Jupyter Notebook15771140
  • keonlee9420/Comprehensive-E2E-TTS

    A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

    Language:Python14111519
  • tuanh123789/AdaSpeech

    An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

    Language:Python9631727
  • ga642381/FastSpeech2

    Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:

    Language:Python918617
  • rishikksh20/LightSpeech

    LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

    Language:Python79957
  • xcmyz/FastSpeech2

    The Implementation of FastSpeech2 Based on Pytorch.

    Language:Python53548
  • hwRG/End-to-End-TTS-Fine-Tune

    Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

    Language:Python24318
  • Adibian/ResGrad

    Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

    Language:Python14
  • AppleHolic/FastSpeech2

    Refactored version of https://github.com/ming024/FastSpeech2

    Language:Python13222
  • dathudeptrai/FastSpeech2

    A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

  • alessandropec/data_driven_ai_voice_cloning

    This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering

    Language:Python8311
  • hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker

    Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.

    Language:Python8111
  • ssmlkl/MnTTS2

    This is the experimental description of MnTTS2.

    Language:Jupyter Notebook7242
  • nikolaStanojkovski/Assistive_Bus_Helper

    An Android application that allows visually impaired people to hear which bus lines are passing next to them.

    Language:Python2101
  • lordzuko/SpeakingStyle

    Aligning latent space of speaking style with human perception using a re-embedding strategy

    Language:Jupyter Notebook1100
  • nikolaStanojkovski/Talk_Through_Me

    An Android application that acts as a speaking assistant for the hearing impaired people.

    Language:Python1
  • quackson/DG_HW

    homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan

    Language:Python1100
  • utkarsh2299/Fastspeech2_HS

    Created this repo as a part of the project "Speech Technologies in Indian languages". About Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.

    Language:Perl0100
  • gagan3012/image2audio

    Convert Image to audio using ViT, GPT and FastSpeech

    Language:Python30
  • lordzuko/FastSpeech2-jax

    Implementation of FastSpeech2 in JAX