Consider per-language module loading

Question

Consider per-language module loading

Closed this issue a year ago · 5 comments

NeonDaniel commented 3 years ago

Objective

Handle loading STT/TTS modules dynamically per-language

Implementation Goals

Expose methods for STT/TTS/Translation engines to advertise their supported languages
Handle optionally loading plugins per-language
Handle STT/TTS/Translation requests with the appropriate plugin

Open Issues

Do we handle language as en, en-US, either?
How many plugins can we efficiently load simultaneously?

Answer 1 · 2021-12-21T20:01:31.000Z

A proposed method for plugins reporting supported languages:

In each relevant plugin's base class, define a new property; if the list is empty, default behavior should assume that any requested language is supported for backwards-compat (probably logging a warning). If the list isn't empty, language should be validated at runtime and raise a custom exception (UnsupportedLanguage(ValueError)?) for the caller to handle.

@property
def supported_languages(self) -> list:
    return []

Answer 2 · 2021-12-21T20:21:58.000Z

in chatterbox before OPM we did this for TTS https://github.com/HelloChatterbox/text2speech/blob/master/text2speech/modules/__init__.py#L110

Answer 3 · 2022-01-08T01:09:37.000Z

in chatterbox before OPM we did this for TTS https://github.com/HelloChatterbox/text2speech/blob/master/text2speech/modules/__init__.py#L110

Makes sense for TTS to also have a method to get voice names since many engines do support multiple voices per language. I'm not sure that all plugins have or use voice names though? I guess it would just equate to an empty list for a supported language, should TTS just implement describe_voices that returns a dict and STT a similar method returning a list?

Opened an issue for this in mycroft-core. MycroftAI/mycroft-core#3059

Answer 4 · 2022-08-02T14:03:13.000Z

somewhat related PR OpenVoiceOS/ovos-plugin-manager#71

Answer 5 · 2023-10-25T17:09:02.000Z

Not highly relevant to Neon anymore; current strategy is to train Nemo STT/Coqui TTS models as needed