Pinned Repositories
CNMT
code for Confidence-aware Non-repetitive Multimodal Transformers for TextCaps (AAAI 2021)
GenerativeImage2Text
GIT: A Generative Image-to-text Transformer for Vision and Language
GRCF
MVLT
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
shmizhang.github.io
xovee.github.io
Home page of Xovee Xu.
shmizhang's Repositories
shmizhang/GRCF
shmizhang/CNMT
code for Confidence-aware Non-repetitive Multimodal Transformers for TextCaps (AAAI 2021)
shmizhang/GenerativeImage2Text
GIT: A Generative Image-to-text Transformer for Vision and Language
shmizhang/MVLT
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
shmizhang/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
shmizhang/shmizhang.github.io
shmizhang/xovee.github.io
Home page of Xovee Xu.