OCR parser meant to be used to register payments
-
Imagemagick: to fix images to make OCR parsing easier.
-
Tesseract: to actually do the parsing.
-
Install Imagemagick
brew install imagemagick # Or apt-get, or whatever.
-
Install Tesseract
brew install tesseract # Or apt-get, or whatever.
$ ./folder.sh $ gem install
The image to convert should be placed inside folder images
, and then run
$ ruby script.rb <input_image_name>
Outputs a file in folder outputs
, that contains the name of the input_image_name
-
Actually return formatted data, even with different pictures
-
Convert to a Gem
-
Testing!