/captcha-decoder

Using tesseract, imagemagick, python, pytesseract, pillow, PIL(python image processing)

Primary LanguagePython

OCR Image Processing

Read out the captcha

Author: Lin Dong

Date: July 22nd, 2016

Prerequisites

OS: Mac

brew install imagemagick
brew install tesseract --all-languages

brew install python2
pip install Pillow
pip install pytesseract

Instructions

Just run: python main.py

  • iteration.py: iterate multiple times of original image
  • convert_to_text.py: read out the string from white-gray image,

Screenshots

Progressive Iterations:

Iteration 1:

Iteration 2:

Iteration 3:

References

  1. python 识别验证码
  2. OCR on OS X with tesseract Raw