NVIDIA/mellotron

Voice synthesis by model is not the same as the voice with speaker ID

tuanh123789 opened this issue · 1 comments

Voice synthesis by model is not the same as the voice with speaker ID

I trained model on my Vietnamese dataset ( 46 speakers ). But when inference my output voice is not the same as the voice with speaker ID. Can you explain more detail to solve this problem. Thank you!