/ocr

Optical character recognition is the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.

Primary LanguagePython

Optical character recognition is the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.

Prerequisites to run this program:

1.Download the package of Tesseract from https://github.com/tesseract-ocr/tesseract/wiki/Downloads 2.Package installation from Command line: “pip install cv2”, ”pip install pytesseract” and also install PIL

Running the Code

1.Test the code easily by copying your images to ‘images’ folder 2.Compile with Command Prompt 3. Mention image name like below eg., “python prob.py --image images/image.jpg 4.You will get the popup of window it is noiseless image 5.It also print the text which is on the image.