Pinned Repositories
FudanOCR
A toolbox of scene text super-resolution and recognition
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
myvqa
The implementation of CLVIN、CAAN and MPCCT
TRAR-VQA
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
RUArt
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
mcan-bert
VQACL
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
erjpc's Repositories
erjpc/FudanOCR
A toolbox of scene text super-resolution and recognition