SSML possible?

Question

SSML possible?

Closed this issue 4 years ago · 2 comments

ozett commented 5 years ago

can i use SSML-Tags to control the tts?
i dont see anything mentioned here...

https://cloud.google.com/text-to-speech/docs/ssml

thx,

Answer 1 · 2020-11-04T08:33:44.000Z

Hi @ozett

I agree with @Hypnos3 here: Hypnos3/node-red-contrib-cast#75 (comment)

zlargon/google-tts uses Google Translate TTS, that is the synthetic voice currently used by Google in Google TRANSLATE web service. This voice is NOT controllable ( a part a minor/immo useless, speed setting ).
Google Text To Speech is the pay-per-usage Google Cloud Platform service. That's another story.

Now, some/small SSML features, as controlling pauses, could be possible, working on-top of zlargon/google-tts.

BTW, I'm working on https://github.com/solyarisoftware/jointts, a concatenative TTS experiment (that use this package, as an option). So by example, you can build speech, configuring that a BLANK correspond to some number of millisecond, A FULL_STOP to a bigger value of msecs, etc.

Nevertheless, I fair it's not worth implementing SSML for such a strict set of features.
My two cents.

Answer 2 · 2020-11-07T08:20:56.000Z

thx for the link to your github, i will have a look