OCR has tendency to misread the '<' between first and middle names.
Nimbus76 opened this issue · 3 comments
Nimbus76 commented
read_mrz()/tesseract tends to interpret the '<' between first and middle name as a 'K'
I have tried multiple scans of varying quality of several passports and this anomaly occurs more often than not. Sometimes, it also interprets the '<' as an "X".
Every other field has been reliable.
canklot commented
Are you using the legacy mode with tesseract?
RanaOsamaAsif commented
Facing the same issue with names, is there any way to fix/improve this behavior?
konstantint commented
@RanaOsamaAsif Try both the legacy and new Tesseract models. In my experience the legacy model was more robust with respect to this particular issue.