soobinseo/Transformer-TTS

Why do you think the model has not converge at 160K?

Closed this issue · 1 comments

Do you have some basis?

I'm not sure, but I think the model have not learned enough attention.
From a lot of experiments, the diagonal attention is the most important measure that separates success and failure for generating samples.