Mixtecan Cognate Database (MixteCoDB)

This database contains lexical entries from Mixtecan languages, which are cognate coded and standardized to IPA. It is a work in progress and continuously updated. For use in your own research and citation, please refer to the most recent release archived in Zenodo. The database is available under the Creative Commons Attribution Share Alike 4.0 International license. Questions, comments, corrections, and the like are most welcome! Please open an issue for that.

MixteCoDB 1.0

The initial creation of the database, which corresponds to its first release (1.0) is explained in:

Auderset, Sandra, Simon J., Greenhill, Christian T., DiCanio, Eric W., Campbell. 2023. Subgrouping in a `dialect continuum': A Bayesian phylogenetic analysis of the Mixtecan language family. Journal of Language Evolution. 8(1). https://doi.org/10.1093/jole/lzad004
Auderset, Sandra, Simon J., Greenhill, Christian T., DiCanio, Eric W., Campbell. 2023. Supplementary Materials to "Subgrouping in a `dialect continuum': A Bayesian phylogenetic analysis of the Mixtecan language family." Available on Zenodo at

MixeCoDB 1.1 - October 2023

Files and content

coding details

file explaining the conversion from orthography to IPA and other details regarding the standardization

metadata

Metadata of the language sample:

DOCULECT = unique identifier for each Mixtec variety (containing only ASCII characters)
VillageName = name of the village where the variety is spoken
Abbreviation / MapAbbr = abbreviations of the varieties used for more legible plotting
AudersetGroup / AudersetGroupSub / AudersetGroupLow = (sub-)classification according to Auderset et al. 2023
JosserandArea / JosserandAreaSub = dialect area classification according to Josserand 1983
Latitude / Longitude = coordinates of the village
Glottocode (if applicable)
ISO639P3code (if applicable)
JosserandCode (if applicable) = code used in Josserand 1983
SOURCE = cite key of the source(s) of the data