TeluguOCR + Chamanti OCR
zuphilip opened this issue · 2 comments
Banti Telugu OCR: https://github.com/TeluguOCR/banti_telugu_ocr
"This framework relies on the ability of a segmentation algorithm to break the text in to glyphs."
Chamanti OCR: https://github.com/rakeshvar/chamanti_ocr
"It will not rely on segmentation algorithms (at the glyph level), making it ideal for highly agglutinative scripts like Arabic, Devanagari etc. We will be starting with Telugu however."
It is hard to guess for me, how good the recognition work, because I don't understand Telugu and I haven't found any results, discussions, blog posts (which I can read). But the project looks IMO very promising from a technical point. CC @rakeshvar
Here is a paper on banti. http://arxiv.org/abs/1509.05962
I will update it to the latest version soon, that will have comparison with Google's
@ChillarAnand
@rakeshvar Thank you for the link to the paper!