tacotron2
There are 44 repositories under tacotron2 topic.
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
BenAAndrew/Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices
NaruseMioShirakana/DragonianVoice
多个SVC/TTS的C++推理库
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
hash2430/pitchtron
TTS for pitch-accented language. Korean dialect DB.
BogiHsu/Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
atomicoo/tacotron2-mandarin
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.
ide8/tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
nipponjo/tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
jieran233/CyberWaifu
GPT + Tacotron2/VITS + Live2D = CyberWaifu
sovaai/sova-tts-engine
Tacotron2 based engine for the SOVA-TTS project
joannahong/Lip2Wav-pytorch
a PyTorch implementation of Lip2Wav
keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
liuhaozhe6788/voice-cloning-collab
an improved version of Real-time-voice-cloning
CookiePPP/cookietts
[Last Updated 2021] TTS from Cookie. Messy and experimental!
lokkelvin2/tacotron2-tts-GUI
Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom Twitch TTS.
thuhcsi/tacotron
PyTorch implementation of Tacotron and Tacotron2
monatis/german-tts
German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support
sooftware/tacotron2
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
atomicoo/Tacotron2-PyTorch
PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。
Mildemelwe/Non-English-Tacotron-2-Training-Notebook
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
parvatijay2901/Hindi-ASR-and-TTS
EC499: Major Project
alessandropec/data_driven_ai_voice_cloning
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Adibian/persian_tacotron
Training Tacotron2 for Persian language as a Persian text-to-speech
wladradchenko/advanced.wunjo.wladradchenko.ru
Extension to add advanced features to Wunjo AI
AppleHolic/tacotron2-pytorch
Pytorch implementation of Tacotron 2 (https://arxiv.org/pdf/1712.05884.pdf)
MahtaFetrat/Persian-MultiSpeaker-Tacotron2
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
mehdihosseinimoghadam/Catalan-Text-to-Speech
Catalan Text to Speech
eros71-dev/mario-voice-dataset
A dataset for Mario's voice (Charles Martinet), from the Super Mario franchise. More info here: https://uberduck.ai/about
spyroot/dtc
Dueling Turing Classifier
threhe13/Voice-replacement-program
Web Page that can replace audio of Education Video
XxNessuxX/Proyecto-de-clonacion-TTS-del-Neng-de-Castefa
Este proyecto explora la síntesis de voz para replicar el distintivo estilo vocal de Neng de Castefa. Analiza desafíos técnicos, tecnologías y presenta experimentos y resultados en la recreación de su voz única.
RALYHDB/ASV-spoofing
This repository contains the code and resources associated with my Bachelor's Thesis. The project evaluates the performance of various automatic speaker verification (ASV) systems against identity spoofing attacks generated using text-to-speech (TTS) synthesis technologies.