Kowsher/Bangla-NLP

Error lemmatizing

Closed this issue · 3 comments

Today I was exploring your lemmatizer on some random raw texts. Some nouns returned wrong lemma which is somewhat acceptable like "শাকিব" becomes "শাক". But the error happens with the word " ববি". It throws error. Check it out as it is proper noun and use as a name.

Well, the problem was relatively trivial. Inside your trie.py file, you search the dictionary char_map directly. Fix it with char_map.get() method. Since this is the only error till now, I am closing this issue.

Basically, we worked on Bangla word not with names. Besides, we have a shortage of Bangla root word, if we include humans Bangla name as well as increase Bangla word it works perfectly. Since the algorithm is trained based on the label