/iso639-autonyms

Native language names linked against their ISO 639-1 and 639-3 codes in machine-readable format

Creative Commons Zero v1.0 UniversalCC0-1.0

ISO 639 Autonyms

This repository is borne out of a need for a singular database of languages mapped to their ISO 639-1 and ISO 639-3 codes with their native name ("autonym") listed against their tags.

While the data has initially been generated from standard sources, contributions are welcomed for any overly-anglicised names (those not using the native character set of the language) or those languages with name variants that have not been included.

Interpretation

tag3 and tag1 columns represent the ISO 639-3 and -1 tags respectively. The name field is the "most recognisable" form of the language name, typically in English, to be used as a fallback where an autonym is not available.

The autonym field is the name of the language in that language. If this field is blank, it means that there is no confirmed autonym for this language in this database and you may use the name field as a fallback.

An autonym being blank does not necessarily indicate that the name field represents the autonym. In the case where they are the same, they should be listed in both columns.

Sources

Here are the currently utilised sources by this repository:

License

Databases in many countries do not attract intellectual property rights, and where they do it, they very rarely attract copyright due to the raw and inexpressive nature of the data. However, to alleviate doubt, this data is being published by a resident of Sweden where sui generis database rights do not apply to non-EU datasets. CLDR and Ethnologue are both datasets published in the US, where database rights also do not apply.

However, for those who have annoying and immovable legal teams and want to use this in a product, the dataset is licensed under the Creative Commons Zero 1.0 license.

Contributing

Any contributions to this repository are on the condition that the contributor relinquishes all database rights, copyrights and all other intellectual property rights or claims to the contribution.

Mark the source as github in the data.