speedy-speech-wn

Question

speedy-speech-wn

rejuce opened this issue 4 years ago · 3 comments

i would like to change tacotron to speedy-speech-wn tts model in the script. any chance you can point me on where i need to change that ? ( i did not find the string directly in the script)

the model is listed when i type tts --list_models and i find it better sounding than tacotron2

Answer 1 · 2021-02-08T00:07:10.000Z

Not too sure, probably best to ask this on the main git: https://github.com/mozilla/TTS

Please do let me know if you figured out if they support it and I can have a look if my loop works for that too.

Answer 2 · 2021-02-08T07:00:48.000Z

thank you for your reply. i think your idea is great of splitting it into sentences and querry sentence by sentence and merge in the end.

i try a differnt approach now, i forked your respository and i am replaceing the tts part with a http call to the tts webserver/docker container. that way the text splitting / wav mergeing part becomes decoupled from the tts. it is already working but occasionally i get the last word of an sentence repeated idefinitely, ahve to lookt into it closer... (i am not profecient in python unfortunately...)

my ultimate goal is to convert whole ebooks with it. if that works i might also build a small Qt frontend for it

Answer 3 · 2021-02-10T11:41:32.000Z

That sounds very cool, please get in touch if you need any help with the python part. In all honesty, when I build this I just started with python, so the script is far from being elegant.