/tess4j

Java JNA wrapper for Tesseract OCR API

Primary LanguageJavaApache License 2.0Apache-2.0

Tess4J

Join the chat at https://gitter.im/nguyenq/tess4j

A Java JNA wrapper for Tesseract OCR API.

Tess4J is released and distributed under the Apache License, v2.0.

Features

The library provides optical character recognition (OCR) support for:

  • TIFF, JPEG, GIF, PNG, and BMP image formats
  • Multi-page TIFF images
  • PDF document format

Dependencies

On Windows: Microsoft Visual C++ 2022 x86 and x64 Runtimes

Since Tesseract and Leptonica Windows binaries were built using Visual Studio 2022 (v143) Platform Toolset, please ensure you have Microsoft Visual C++ 2022 Redistributable installed.

Tutorial

Development with Tess4J in NetBeans, Eclipse, and Command-line

Contributors

A big thanks to GitHub and all of Tess4J's contributors.