erjpc

Pinned Repositories

FudanOCR
A toolbox of scene text super-resolution and recognition
Language:Python00
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python5.5k 114 657938
myvqa
The implementation of CLVIN、CAAN and MPCCT
Language:Python7 1 20
TRAR-VQA
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
Language:Python66 3 718
RUArt
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
Language:Python10 2 31
mcan-bert
Language:Python41
VQACL
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
Language:Python32 3 196

erjpc/FudanOCR
A toolbox of scene text super-resolution and recognition
Language:Python00