Issues
- 22
Support Python 3
#39 opened by madalu - 0
- 2
adopt IETF language tags (BCP 47)
#33 opened by jwilk - 0
- 0
gocr: "trying pxH-fix by Hxp …"
#45 opened by jwilk - 0
- 1
Multiprocessing support
#43 opened by FriedrichFroebel - 0
Unclear origin of OCR engine messages when using -j
#42 opened by jwilk - 5
Keep maintaining the package in Debian and get the program back to the distribution
#38 opened by jsbien - 3
-X extra_args='-psm 1' option
#40 opened by derrikF - 3
- 0
DjVu to PAGE-XML converter
#37 opened by jwilk - 18
Multiple jobs do not work with Tesseract 4
#31 opened by ashipunov - 4
- 2
- 1
allow passing arbitrary options to Tesseract
#30 opened by jsbien - 2
Tesseract: 3.02: Malformed hOCR document: character zones intermixed with non-character zones
#8 opened by jwilk - 0
Windows support
#35 opened by jwilk - 1
- 1
- 3
- 0
adopt hOCR utilities from marasca
#34 opened by jwilk - 1
TSV support (tsv2djvused)
#28 opened by jsbien - 6
ocrodjvu for tesseract 3.04.00
#14 opened by jwilk - 1
Allow editing hOCR (or TSV) files
#19 opened by jwilk - 2
quneiform support
#24 opened by jwilk - 0
parallel mode for djvu2hocr
#25 opened by jwilk - 3
Non-ASCII filenames cause UnicodeEncodeError
#23 opened by derrikF - 0
djvu2hocr: extract XMP metadata
#22 opened by jwilk - 4
- 7
Crash with empty page
#7 opened by jwilk - 3
ValueError: need more than 0 values to unpack
#18 opened by jwilk - 10
Fix & document exit codes
#6 opened by jwilk - 5
djvused script without escaping Unicode characters
#13 opened by jwilk - 1
Support ocropus 0.6
#2 opened by jwilk - 0
Support for UZN files?
#17 opened by jwilk - 9
- 3
ocrodjvu hangs with DjVuLibre 3.5.26
#16 opened by jwilk - 5
ocrodjvu creates an incorrect djvused script?
#12 opened by jwilk - 6
Version 0.7.18 does not start
#11 opened by jwilk - 6
tesseract engine (v 3.03) not found
#9 opened by jwilk - 3
freeze if a page cannot be decoded
#5 opened by jwilk - 3
crashes on non-UTF-8 file identifiers
#4 opened by jwilk - 2
Support multi-languages with Tesseract
#3 opened by jwilk - 3
process multiple html files with hocr2djvused
#1 opened by jwilk