problem with arabic lines
zMuh opened this issue · 3 comments
If you need horizontal lines to be detected, you have to fine-tune CRAFT model by yourself.
https://github.com/JaidedAI/EasyOCR/tree/master/trainer/craft
Hi @romanvelichkin,
I think you didn't understand what the original reporter was trying to say.
Take a look at the second line in the text, for example. I mean the line after the short line. You can see that six blocks (I'm not familiar with OCR and don't know what they are called) are detected in a single line. Each of these blocks will appear as a separate line in the output.
To make matters worse, they are printed one by one, from left to right, in a text that is supposed to be written from right to left!
Hi @rezaies,
So he meant lines of text, I thought he meant that little line at the bottom of page.
To make the boxes more sparse, there are many settings that can be set for this before scanning.
To make matters worse, they are printed one by one, from left to right, in a text that is supposed to be written from right to left!
I think it is not that hard to write a parser for the scan results, given this situation.