/TTS_VAE

Text to Speech Synthesis based on controllable latent representation

Primary LanguagePython

TTS_VAE

A conditional generative model based on the variational autoencoder (VAE) to get disentangled representation to have controllable Text to Speech Synthesis.

I am working on this paper https://arxiv.org/pdf/1810.07217.pdf