Language Statistics Data Data in this repository was generated from a processing pipeline in https://github.com/liamks/languageStatistics