DanRuta/xVA-Synth

How about translation other languages?

psycalc opened this issue · 3 comments

Guide plі in which (direction) neuron network (neural framework) you use and why? in wich direction should I look, in order too make voices more realistic, and sound in other languages? Is it possible at all or it is very complex and hard to train network?

v3 now supports multiple languages. A voice trained in English can somewhat also speak another language. Though more monotonally.

That article is super old. The v3 model now uses a slightly custom tweaked VITS/YourTTS model. Tweaks including larger capacity, bigger lang embedding, custom symbol set (a custom spec of ARPAbet with some more phonemes to cover other languages), and I guess a different training script.