A repo of NLP tools for the Nepali Language. Using this as a sandbox to learn and tinker with language models and other NLP concepts. Using the Nepali language as the base because there are already a lot of people working on english NLP, I'm hoping to contribute to the progress of Nepali NLP.
- Nepali News Crawler
- Word Tokenizer
- Sentence Tokenizer
- Stemmer
- Word Embeddings (word2vec, GloVe, FastText, Transformers)
- Language Model
- Nepali Corpus