/xhosa-nlp

🔤 Extracting most frequent words in the Xhosa language using NLP

Primary LanguagePython

Xhosa NLP 🔤

GitHub license PRs Welcome

Quickstart

  1. Install NLTK
  2. Run python most_frequent_words.py
  3. Open results.csv to view results

Source of Corpus

Leipzig Corpora Collection

CITE: D. Goldhahn, T. Eckart & U. Quasthoff: Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages. In: Proceedings of the 8th International Language Ressources and Evaluation (LREC'12), 2012