/jTessBoxEditor

Box editor and trainer for Tesseract OCR

Primary LanguageHTML

jTessBoxEditor

A box editor and trainer for Tesseract OCR, providing editing of box data of both Tesseract 2.0x and 3.0x formats and full automation of Tesseract training. It can read images of common image formats, including multi-page TIFF. The program requires Java Runtime Environment 8 or later.

Note: LSTM Training for Tesseract 4.0x is not supported.

jTessBoxEditor is released and distributed under the Apache License, v2.0.

Features

  • Tesseract Windows training executable 5.3.3 bundled

System requirements

Java.

Command line

java -Xms128m -Xmx1024m -jar jTessBoxEditor.jar