Content of synthesis don't match input text
Opened this issue · 0 comments
ajinkyakulkarni14 commented
Hello
I trained transformer tts + postnet on SIWIS french speech corpus. The quality of synthesis is quite good. But content of speech doesnt match with the input text. Also begining of speech signal of all generates test samples have similar phonetic content. I trained the transformer network for 471548 steps and postnet network for 431742 steps. The amount of training data was of 4hrs.
So issue is in fine tunning of parameters or shouls I train network for more iterations ?
sample outputs are given below,
https://drive.google.com/file/d/1VpDWCj5eqrRcWufkoPlQJoTYVmLBZ4-u/view?usp=sharing
https://drive.google.com/file/d/1x3ERREDRpMPNFZS-EVJ7Og7o4pS37QzB/view?usp=sharing
Any help or suggestions will be helpful.