soobinseo/Transformer-TTS

Content of synthesis don't match input text

Opened this issue · 0 comments

Hello

I trained transformer tts + postnet on SIWIS french speech corpus. The quality of synthesis is quite good. But content of speech doesnt match with the input text. Also begining of speech signal of all generates test samples have similar phonetic content. I trained the transformer network for 471548 steps and postnet network for 431742 steps. The amount of training data was of 4hrs.

So issue is in fine tunning of parameters or shouls I train network for more iterations ?

sample outputs are given below,

https://drive.google.com/file/d/1VpDWCj5eqrRcWufkoPlQJoTYVmLBZ4-u/view?usp=sharing

https://drive.google.com/file/d/1x3ERREDRpMPNFZS-EVJ7Og7o4pS37QzB/view?usp=sharing

Any help or suggestions will be helpful.