TEXT-RECOGNITION-USING-PYTHON
Text Recognition from images is an active research area which attempts to develop a computer application with the ability to automatically read texts from images. The project is based on the same idea. Scene based text detection, document scanning, 2d image scanning are the main highlights.
Language used-PYTHON
Modules Used-Opencv,Pytesseract,Easyocr,Tkinter
Dependencies
Python-tesseract is a wrapper for Google’s Tesseract-OCR Engine(https://github.com/tesseract-ocr/tesseract). It can read all image types supported by the Pillow and Leptonica imaging libraries, including jpeg, png, gif, bmp, tiff, and others. In addition it can also extract text from images and write it into another file.
pip install pytesseract
Specify the tesseract.exe directory in the code to the cmd.
Here it is in my case- p.pytesseract.tesseract_cmd=r'C:\Users\hp\AppData\Local\Tesseract-OCR\tesseract.exe'
For Windows, please install torch and torchvision first by following the official instruction here https://pytorch.org. On pytorch website, be sure to select the right CUDA version you have. If you intend to run on CPU mode only, select CUDA = None
pip install easyocr
PIL is the Python Imaging Library which adds image processing capabilities to your Python interpreter.
pip install Pillow
Deployment
To deploy this project first clone the repository using git command or by downloading the zip file. Now forward to the downloaded directory.Run the runner.py file.
python runner.py
After the tkinter ui loads select the option which best describes the image type
- scene detection(for license plate recognition,sign boards,name plates)
- document scanner(for images of pdf type)
- 2d image scanner(for skewed images)
Then select the image from the image browser and press Enter. This will load the image and the results.
Acknowledgements
Thanks to the youtube channel by- Nicholas Renotte
Thanks to the youtube channel- Murtaza's Workshop-Robotics and AI
Article on ocr- Nanonets