laurieburchell/open-lid-dataset
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
PerlGPL-3.0
Stargazers
- AbrahamLopez10Botpress.com
- alvations
- andjcMelbourne, Australia
- bryandeng@Tencent
- ccoreilly@parloa
- chris-ha458Independent research with EleutherAI, DuckAI
- cnlinxiShanghai
- egorsmkv50.4501° N, 30.5234° E
- Ekaterina-Sinkova
- fcheslack
- fly51flyPRIS
- jorirsanUniversitat Politècnica de València
- kandation
- kargaranamir@cisnlp
- karrynestHuawei
- kleczekr
- laurieburchellUniversity of Edinburgh
- LBeaudouxFrance
- MatthieuFP
- nampdnSomewhere on Earth
- NatureLProokie
- picografixPicografix
- pjox@commoncrawl
- raymondng76AI Singapore
- santhoshtr@wikimedia @smc
- sergeyfData Cowboys
- sunnnnnnnny
- tomhoskingEdinburgh
- ucyang
- Uineljpoolside
- wannaphong@PyThaiNLP
- weikang-wang
- WikidepiaIndonesia
- yikang0131
- ZHAOTINGSakana AI
- ZJaumePrompsit Language Engineering