/northeuralex

CLDF dataset derived from Dellert et al.'s "NorthEuraLex" from 2020

Primary LanguageTeXCreative Commons Attribution 4.0 InternationalCC-BY-4.0

CLDF dataset derived from Dellert et al.'s "NorthEuraLex (Version 0.9)" from 2020

CLDF validation

How to cite

If you use these data please cite

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at http://www.northeuralex.org

Conceptlists in Concepticon:

Notes

This large database covers several languages of Northern Eurasia. For the conversion to CLDF, we considerably adjusted the IPA in the source.

Statistics

CLDF validation Glottolog: 100% Concepticon: 94% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 107 (linked to 107 different Glottocodes)
  • Concepts: 1,016 (linked to 954 different Concepticon concept sets)
  • Lexemes: 121,611
  • Sources: 1
  • Synonymy: 1.15
  • Invalid lexemes: 0
  • Tokens: 699,892
  • Segments: 678 (0 BIPA errors, 0 CLTS sound class errors, 676 CLTS modified)
  • Inventory size (avg): 52.43

Contributors

Name GitHub user Description Role
Tiago Tresoldi @tresoldi patron Other
Julius Steuer @justeuer orthographic profile Other
Johann-Mattis List @LinguList code, integration Editor
Robert Forkel @xrotwang code, integration Editor
Johannes Dellert editor DataCurator, DataManager, Author
Pavel Sofroniev @pavelsof original team cdlf curation DataCurator, DataManager

CLDF Datasets

The following CLDF datasets are available in cldf: