tibetan-nlp
There are 15 repositories under tibetan-nlp topic.
OpenPecha/Botok
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
OpenPecha/pybo
🦜 NLP for Tibetan, in Python.
Esukhia/Corpora
repo for Tibetan corpora
Esukhia/bophono
Tibetan phonetics engine in Python
luciusssss/mc2_corpus
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Esukhia/ud-pos-tagger-bo
Basic Universal Dependencies Part-of-Speech Tagger for Tibetan
Esukhia/bospell-old
An application of PyBo to Tibetan Spell-Checking
Esukhia/bo-pos
Resources connected to Tibetan part of speech
billingsmoore/MLotsawa
This app is a first step toward providing effective machine translation for the Classical Tibetan corpus of important religious, philosophical, and historical texts that were nearly lost during the invasion of Tibet.
Esukhia/bo-freq-diff
syllable-based diffs that make use of google's diff-match-patch and pybo's preprocess
Esukhia/canon-freq
Tibetan canon segmented and tagged for frequency analysis
Esukhia/punct_patterns
Discover punctuation patterns and usage in Derge Kangyur