gxrxrdx/tesseract-ocr

Tesseract 3.02 not working with .tiff files out of the box on Windows.

Closed this issue · 4 comments

What steps will reproduce the problem?
1. Download and install tesseract-ocr-setup-3.02.02.exe 
   (mark developer tools during installation) 

2. try to OCR a tiff file, e.g.:
$ tesseract converted0.tiff ocr_converted0tiff -psm 3

What do you see instead?
Tesseract Open Source OCR Engine v3.02 with Leptonica
Unsupported image type.

What version of the product are you using? On what operating system?
Tesseract v3.02 with Leptonica
Windows 7 64-bit

Please provide any additional information below.
(during the installation of the .exe I received a 404 file not found message, 
it didn't show which file wasn't found though)

Original issue reported on code.google.com by brucardo...@gmail.com on 26 Jan 2015 at 5:43

you forgot to mention the language?!

$ tesseract converted0.tiff ocr_converted0tiff -l lang  -psm 3

Original comment by sivakuma...@gmail.com on 30 Jan 2015 at 8:41

If it was not specified, It should be using the default one (eng).

Original comment by brucardo...@gmail.com on 30 Jan 2015 at 11:38

1. Tesseract works with .tiff files out of box on Windows. But there are 
different tiff files (e.g. they can use different compression)
2. You forget to attach converted0.tiff

Original comment by zde...@gmail.com on 6 Feb 2015 at 11:07

Original comment by zde...@gmail.com on 12 Apr 2015 at 3:41

  • Changed state: WontFix