Book Title Recognition

Pipeline

img1


Text detection

for text detection we used EAST (An efficient and accurate scene text detector) Figure_2


Image Processing

we simply merge the bounding boxes into a larger one most probably it will be the title Figure_3


Text Detection

we apply only the title image to EAST Figure_1


Text Recognition

we use tesseract for text recognition for thr image Figure_1

The output text:
(THIRD EDITION Textbook of Geotechnical Enoineering)


Text Processing

we applied some simple text processing methods:
-removing punctuation.
-removing single characters from the list of results.
-removing duplication.