smoke-trees/Voice-synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Python
Stargazers
- 15857541616
- adionditsakurteknik
- agarkraPwC
- aitalk
- andreabak
- avilashSelf - Employed
- bakszero@LinkedIn
- BeibeiOuyang
- bill4aus@KitSoftAU
- drock577Intel
- echoaimaomao
- flagman
- guilhermeeuzebioClosecare
- HIN0209
- JananiPN19Virtusa
- javaarchive@Prism-Client
- joshbybeeHallmark Cards, Inc.
- mapledxf
- Neverik
- progmars
- rajateku
- richieyoum@kohofinancial
- ruclionTsinghua University
- Severus11Bengaluru
- shivkanthb@mainframecomputer
- Surya2709Tirupur
- thundreeBrazil
- TiramisuJ
- Usama3627
- vincentweikey
- wahidmounir
- WhiteFu
- wulb2018
- YA-Cong
- Yannecc
- zhihanyang