csebuetnlp/normalizer
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Python
Stargazers
- aaniksahaaDhaka, Bangladesh
- anirbanpranto@MoneyLion
- AtaullhaSynesis IT PLC
- BrightXiaoHanIfun Game
- DDR13GITPathao Limited
- Devorein@Charmverse
- DhimanAkash68
- farihat13University of Wisconsin-Madison
- FojleRabbiRabibBangladesh
- GaziIntelligentMachines
- imr555Neovotech
- JaidJashim
- KSMubasshirPurdue University
- lima21bdDaffodil International University
- linzai1992@Microsoft
- M-A-R-PIncubator LLC
- MahirMahbubBangabandhu Sheikh Mujibur Rahman Digital University, Bangladesh
- mahmudhasankhanDhaka, Bangladesh
- mohammadeunusDhaka,Bangladesh
- MohammedAli-11MyCell Technology Limited
- nityatimalsinaBoulder, Colorado
- npuichigoSpeechify
- riyadhrazzaqCMED Health
- RudaibaAdnin
- Sabbir772002Dhaka
- sagorbrur@hishab-nlp
- SaifurOWL
- sanderlandcohere.com
- ShawonAshrafellamind GmbH
- Spock-AINew Cool Discoveries LLC
- tahsintunan@helius-labs
- Tasfiul-HedayetDhaka, Bangladesh
- woqiang0515
- x-legion