[INPUT] Text (or Speech) Length of Blaser 2.0

Question

[INPUT] Text (or Speech) Length of Blaser 2.0

foreveronehundred opened this issue 9 months ago · 1 comments

foreveronehundred commented 9 months ago

For translation quality estimation of Blaser 2.0, I think there is no limitation of the text (or the speech) length. However, from my personal perspective, I do not think the estimation will be accurate if the text (or the speech) is too long.

So, what text length and speech length (of source, reference, and hypothesis) do you recommend?

Answer 1 · 2024-04-16T16:22:07.000Z

SONAR was trained as a sentence encoder, so it expects the source, reference, and hypothesis to be single sentences.

Speech (or transcribed text without punctuation) is not always easy to segment into sentences, you can think of 30 seconds maximum as a rule of thumb.

Also, please check out the discussions of input lengths in the Seamless Communication repo (https://github.com/search?q=repo%3Afacebookresearch%2Fseamless_communication+length&type=issues), because Seamless and SONAR models were trained on similar tasks and similar data.