END-TO-END pipeline for text detection and recognition

originals:

  • Text detection - EAST
  • Text recognition - CRNN

Download

Download pretrained models:

  1. frozen tensorflow EAST model here [97M]
  2. pretrained CRNN English model from here [34M]

and put it to pretrained_models folder

Download datasets:

  • [optional] Small train - here [529M]
  • Validation - here [118M]

and put it to data folder

Run

  • test image - run_demo_server.py and open http://0.0.0.0:8769/
  • validation - validation.py
Baseline (English pretrained model) Benchmark (25 epochs on 80K SynthText)
Char precision 0.1569 0.3218
Word precision 0.1017 0.1175

Usefull links

CRNN

Generate synthetic dataset

Other OCR Links

Papers/repositories/tools about text detection and recognitions: