Capture2Text
Linux CLI port of Capture2Text v4.5.1 (Ubuntu)
The OCR results from Capture2Text were generally better than standard Tesseract, so it seemed ideal to make this run on Linux. It turns out the console version can actually run without anything more than build changes.
Dependencies
Tesseract, Qt 5 (cannot be version 4), Leptonica
sudo apt-get install -y libtesseract-dev qtbase5-dev libleptonica-dev
Build instructions
QT_SELECT=5 qmake Capture2Text/Capture2Text.pro -d
make
wget "https://github.com/GSam/Capture2Text/releases/download/Prototype/English.zip"
unzip -o "English.zip" -d "tessdata"
rm "English.zip"
TODO
- Make the build file more generic, so as to be able to build on Windows still
- Add some additional instructions on packages to install and how to build
- Figure out how to avoid
-platform offscreenfor the command line (over SSH)