The eng.traineddata provided by default cannot be used for OSD

Question

piyushspss opened this issue 4 years ago · 3 comments

Answer 1 · 2020-05-28T11:26:10.000Z

The provided file is not faulty, but simply a model made for fast OCR, so fits for most users.

If you want to do OSD, either don't specify a language or - if you have to specify a language - get a traineddata file from https://github.com/tesseract-ocr/tessdata/.

Answer 2 · 2020-05-28T11:51:48.000Z

Thanks for the reply Stweil,

I tried osd without specifying a language with passports and it was not giving me correct results. Please try once.

Answer 3 · 2020-05-28T12:22:23.000Z

Indeed. Then you have to get the right eng.traineddata.