The eng.traineddata provided by default cannot be used for OSD
piyushspss opened this issue · 3 comments
piyushspss commented
Current Behavior: The default eng.traineddata is just 4017 kb and osd is not working for this.
Expected Behavior: The actual eng.traineddata from tesseract github is 23956kb and OSD is working as expected.
Suggested Fix: Please update the file.
stweil commented
The provided file is not faulty, but simply a model made for fast OCR, so fits for most users.
If you want to do OSD, either don't specify a language or - if you have to specify a language - get a traineddata file from https://github.com/tesseract-ocr/tessdata/.
piyushspss commented
stweil commented
Indeed. Then you have to get the right eng.traineddata.