Helsinki-NLP/Tatoeba-Challenge

Indonesian missing?

jvamvas opened this issue · 2 comments

Thank you for providing this great resource!

It seems that Indonesian is not part of the Challenge right now. On Opus, there are 11.8M sentence pairs for Indonesian–English alone. Was Indonesian left out on purpose?

Indonesian is part of the 'msa' models as we use Macro-languages as specified in ISO639 standards.

Thanks!