PoC of OCR model based on Transformer architecture (Attention is all you need)
References:
nlp.seas.harvard.edu/2018/04/03/attention.html
jalammar.github.io/illustrated-transformer/
github.com/facebookresearch/detr
github.com/yunjey/pytorch-tutorial/tree/master/tutorials/03-advanced/image_captioning