anoopkunchukuttan/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
PythonMIT
Issues
- 0
Replacement of semicolon by visarga for normalization in Telugu creates invalid/misformed words
#73 opened by ramSeraph - 0
Sentence tokenizer creating issue while splitting for end of the sentence.
#71 opened by varunkatiyar819 - 1
Better tokenization of numbers needed
#40 opened by anoopkunchukuttan - 0
Add more NLP models list
#67 opened by rajveer43 - 1
- 4
Making indic-nlp-library available via Conda Forge
#63 opened by Ubadub - 3
- 1
Is translate function available?
#48 opened by udaykumar1998 - 1
Inappropriate Hindi English Transliteration
#52 opened by Sonali210 - 2
- 1
- 2
Broken "Getting Started" links
#62 opened by Rhitabrat - 6
Transliteration not working
#35 opened by RaviTeja51 - 1
- 2
BrahmiNet is down
#57 opened by ma08 - 0
ImportError: No module named indicnlp.common
#55 opened by A-d-DASARE - 1
Make a kaggle dataset to use this library in the inferece of a kaggle competetion
#51 opened by I-am-sayantan - 6
Issue in Romanization
#38 opened by Sreelakshmi-k - 0
Schwa deletion in romanization for Hindi
#50 opened by anilkumar911 - 1
Undo wrong Moses tokenization
#36 opened by anoopkunchukuttan - 3
- 0
- 2
Placement of Anuswara
#43 opened by shantanuo - 4
Text Normalisation
#41 opened by ShubhamKumarNigam - 0
- 1
- 0
Preserve abbreviation punctuation for Tokenization & adding more abbreviations for Sentence Splitting
#30 opened by rhn19 - 3
vectors for SOS and EOS
#34 opened by samyakag - 4
Unable to do Machine Translation
#26 opened by aastha19 - 2
Detect the language of transliterated text
#33 opened by bnriiitb - 4
loaderload() fails in latest pandas
#32 opened by sayanb-7c6 - 3
- 10
Code normalization error for Malayalam
#7 opened by patelrajnath - 4
- 2
- 2
Normalizer Not working with other Options
#21 opened by bsaid5654 - 0
- 3
- 2
Can you publish this library on pip?
#22 opened by epicfaace - 4
Computing similarity between languages
#25 opened by VP007-py - 4
unable to use indic_nlp_library
#18 opened by riktimmondal - 9
Morphogical analyser
#13 opened by ashnamp - 0
Unit-testing
#16 opened by arcturusannamalai - 2
Tokenization failing for IITB Monolingual corpus
#15 opened by shantipriyap - 1
indic_tokenize
#14 opened by hlsrekha - 3
- 7
- 0
Script Conversion
#10 opened by anoopkunchukuttan - 0
Orthograhic syllabification
#9 opened by anoopkunchukuttan - 5
Introduction to the CLTK
#8 opened by kylepjohnson