machinetranslate/machinetranslate.org

List closely related languages

Opened this issue · 1 comments

There are language groups, where the name of the group is more common to search, so it's good for an article: #475

But there are also just very closely related languages, where one may also be the macrolanguage, but really we just need them to point to each other:

  • Filipino, Tagalog
  • Serbian, Bosnian, Croatian, Montenegrin
  • Hindu, Urdu
  • Persian, Tajik
  • Arabic, Algerian Arabic, Levantine Arabic...

In many of these cases, it's effectively one language., the difference is really the script (which may point to some religious difference or some occupier-imposed difference - which matters for machine translation.

We probably shouldn't get into specifying the type of relationship, but rather, just listing closely related languages, so people can easily find what they're looking for.

Same for

  • Berber, Kabyle, Tamasheq