Updated on March 14, 2019 (Updated table ranking with evaluation protocol info.)
Updated on November 26, 2018 (Table ranking is included for reference.)
Updated on August 24, 2018 (Newly added annotation tool folder.)
Updated on May 15, 2018 (Added groundtruth in '.txt' format.)
Updated on May 14, 2018 (Added feature - 'Do not care' candidates filtering is now available in the latest python scripts.)
Updated on April 03, 2018 (Added pixel level groundtruth)
Updated on November 04, 2017 (Added text level groundtruth)
Released on October 27, 2017
- The results from recent papers on the Total-Text dataset are listed.
Method | Precision (%) | Recall (%) | F-measure (%) | Published at |
---|---|---|---|---|
FTSN [paper] (*Pascal VOC IoU metric) | 84.7* | 78.0* | 81.3* | ICPR2018 |
TextField [paper] | 81.2 | 79.9 | 80.6 | TIP2019 |
CSE [paper] (^Polygon Regression) | 81.4(80.9^) | 79.7(80.3^) | 80.2(80.6^) | CVPR2019 |
MSR [paper] | 85.2 | 73.0 | 78.6 | arXiv:1901.02596 |
TextSnake [paper] | 82.7 | 74.5 | 78.4 | ECCV2018 |
CTD [paper] | 74.0 | 71.0 | 73.0 | PR2019 |
TextNet [paper] | 68.2 | 59.5 | 63.5 | ACCV2018 |
Mask TextSpotter [paper] | 69.0 | 55.0 | 61.3 | ECCV2018 |
CENet [paper] | 59.9 | 54.4 | 57.0 | ACCV2018 |
Textboxes [paper] | 62.1 | 45.5 | 52.5 | AAAI2017 |
EAST [paper] | 50.0 | 36.2 | 42.0 | CVPR2017 |
Baseline [paper] | 33.0 | 40.0 | 36.0 | ICDAR2017 |
SegLink [paper] | 30.3 | 23.8 | 26.7 | CVPR2017 |
End-to-end Recognition (None refers to recognition without any lexicon; Full lexicon contains all words in test set.)
Method | None (%) | Full (%) | Published at |
---|---|---|---|
CNN+CTC [paper] | 77.5 | - | Neurocomputing2019 |
TextNet [paper] | 54.0 | - | ACCV2018 |
Mask TextSpotter [paper] | 52.9 | 71.8 | ECCV2018 |
Textboxes [paper] | 36.3 | 48.9 | AAAI2017 |
(Note that these results are extracted from respective published papers. If your result is missing or incorrect, please do not hesisate to contact us.)
In order to facilitate a new text detection research, we introduce the Total-Text dataset (ICDAR-17 paper) (presentation slides), which is more comprehensive than the existing text datasets. The Total-Text consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
If you find this dataset useful for your research, please cite
@inproceedings{CK2017,
author = {Chee Kheng Ch’ng and
Chee Seng Chan},
title = {Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition},
booktitle = {14th IAPR International Conference on Document Analysis and Recognition {ICDAR}},
pages = {935--942},
year = {2017},
doi = {10.1109/ICDAR.2017.157},
}
Suggestions and opinions of this dataset (both positive and negative) are greatly welcome. Please contact the authors by sending email to
chngcheekheng at gmail.com
or cs.chan at um.edu.my
.
The project is open source under BSD-3 license (see the LICENSE
file). Codes can be used freely only for academic purpose.
Copyright 2018, Center of Image and Signal Processing, Faculty of Computer Science and Information Technology, University of Malaya.