Pinned Repositories
awesome-bangla
A collection of tools, datasets and resources on Bangla computing
bangla-academy-sort
A library of functions in different languages for sorting according to the standard sorting order defined by Bangla Academy (বাংলা একাডেমী)।
bengali-stemmer
A library of implementations of published stemming methods for the Bengali language.
bondhon
Bondhon, Bengali for "Connection", is a Python module for converting between popular modern and legacy Bengali character encodings.
corpora-preparation
corpus-builder
toolkit for compiling corpus from various sources
lemmatizer
A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.
shobdohash
Bengali Soundex (Phonetic Similarity Algorithm) Implementation
spacy-models
vocab-data
Raw vocabulary, word-lists and related scripts
BanglaKit's Repositories
banglakit/awesome-bangla
A collection of tools, datasets and resources on Bangla computing
banglakit/corpus-builder
toolkit for compiling corpus from various sources
banglakit/shobdohash
Bengali Soundex (Phonetic Similarity Algorithm) Implementation
banglakit/bondhon
Bondhon, Bengali for "Connection", is a Python module for converting between popular modern and legacy Bengali character encodings.
banglakit/bengali-stemmer
A library of implementations of published stemming methods for the Bengali language.
banglakit/lemmatizer
A rule-based lemmatizer for Bengali / Bangla based written in Python. Under active development.
banglakit/spacy-models
banglakit/bangla-academy-sort
A library of functions in different languages for sorting according to the standard sorting order defined by Bangla Academy (বাংলা একাডেমী)।
banglakit/corpora-preparation
banglakit/vocab-data
Raw vocabulary, word-lists and related scripts
banglakit/number-to-bengali-word
A python package to convert numbers to bengali words.
banglakit/transliteration-data
Transliteration data for tasks between Bengali written in Roman and Bengali script
banglakit/bengali-ner-data
Annotated dataset for training an NER for Bengali
banglakit/translit-rnn
Automatic transliteration with LSTM
banglakit/bondhon-docx
Python module for converting Office Open XML (DOCX) files between legacy and modern Bengali character encodings.
banglakit/contributors-guide
Guideline for existing and new contributors to working with BanglaKit
banglakit/bengali-sbd
Simple Rule-Based Sentence Boundary Detection for Bengali
banglakit/.github
banglakit/banglakit.github.io
banglakit/TextRecognitionDataGenerator
A synthetic data generator for text recognition