The code organization is WIP at the moment. Will update the repo after it is ready to be checked in.

Signboard transliteration

A project to perform transliteration on commercial signboard images.

The project involves 3 stages:

The project contains 4 ipython notebooks written in PyTorch. The notebooks can be described as follows:

proj_station_detection_preprocess.ipynb: Used to correct bounding box annotations provided by the dataset. This is a preprocessing step before the object detection step.
proj_station_detection_od.ipynb: Used to finetune a Faster-RCNN model to detect text regions in Signboard images
project_station_td_preprocess.ipynb: Used to tokenize the text labels provided in the Text Recognition dataset. This is a preprocessing step before the Text Recognition step.
project_station_td_recognition.ipynb: Used to train a Text Recognition systems using an encoder decoder model inspired by the paper Show and Tell.
Text Transliteration: [WIP]

Author

Kushagra Pandey / @kpandey008