gxrxrdx/tesseract-ocr

tr file Punjabi -FAILURE! Couldn't find a matching blob

Opened this issue · 1 comments

What steps will reproduce the problem?
1. Use JTessBoxEditor to generate box and tif files
2. Use following files attached with the following command
3. tesseract [lang].[fontname].exp[num].tif [lang].[fontname].exp[num] box.train
i.e.

tesseract pan.raavi.exp0.tif pan.raavi.exp0 box.train

APPLY_BOXES:
   Boxes read from boxfile:     275
   Boxes failed resegmentation:     174
   Found 101 good blobs.
   Leaving 13 unlabelled blobs in 0 words.
Generated training data for 2 words


What is the expected output? What do you see instead?
for their not to be couldn't find a matching blob error.


What version of the product are you using? On what operating system?
v3.02
Windows 8.1 

Please provide any additional information below.

Original issue reported on code.google.com by dalbirsi...@googlemail.com on 6 Feb 2015 at 10:36

Attachments:

APPLY_BOXES: boxfile line 18970/Ó¿ª ((2265,1626),(2273,1632)): FAILURE! 
Couldn't
 find a matching blob
FAIL!
APPLY_BOXES: boxfile line 19001/ ((170,1603),(175,1611)): FAILURE! 
Couldn't f
ind a matching blob
FAIL!
APPLY_BOXES: boxfile line 19047/Ó¿╣ ((505,1603),(511,1609)): FAILURE! 
Couldn't f
ind a matching blob
FAIL!
APPLY_BOXES: boxfile line 19064/- ((650,1606),(652,1607)): FAILURE! Couldn't fin
d a matching blob
APPLY_BOXES:
   Boxes read from boxfile:   19070
   Boxes failed resegmentation:    1029
APPLY_BOXES: Unlabelled word at :Bounding box=(1972,1876)->(1975,1879)
   Found 18041 good blobs.
   Leaving 23 unlabelled blobs in 0 words.
   1 remaining unlabelled words deleted.
TRAINING ... Font name = raavi
Generated training data for 295 words

Original comment by dalbirsi...@googlemail.com on 11 Feb 2015 at 6:34