Beginners guide to start using OCR (macOS Sierra 10.12.x)
- Eclipse Java EE IDE for Web Developers. Version: Neon.2 Release (4.6.2)
- macOS Sierra 10.12.6
- Maven
sudo port install tesseract #using mac port
brew install tesseract
- Download a sample image which you can pass to tesseract (keep in mind the image should be noise free and have black text over white background)
Use the sample image provided with this example
''' tesseract $imageName $outputFile '''
- Create a maven project using eclipse
- copy the dependencies from pom.xml and run a maven build
- build goal : clean verify -U
- Refer the App.java
clone this repository and import it into eclipse
- If you rename the folder tessdata in the project, change inside InitTess.java : tesseract.setDatapath("{folder.name}");