/ocr-open-dataset

list all open dataset about ocr.

Apache License 2.0Apache-2.0

ocr-open-dataset

list all open dataset about ocr.

printed

dataset year
Born-Digital Images (Web and Email) 2011-2015
COCO-Text 2017
Text Extraction from Biomedical Literature Figures 2017
Focused Scene Text 2013-2015
Text in Videos 2013-2015
Incidental Scene Text 2015
The Chars74K dataset 2009
The Uber Text dataset 2017
The Street View Text Dataset 2012
The Street View House Numbers (SVHN) Dataset 2011

handwritten

dataset year
mnist 1998
NIST Special Database 19 1995-2016
The EMNIST Dataset 2017
IAM Handwriting Database 1999-2002
CASIA Online and Offline Chinese Handwriting Databases 2007-2010
CROHME: Competition on Recognition of Online Handwritten Mathematical Expressions 2012-2013

mixed printed and handwritten

dataset year
ETL Character Database 1973-1984