textvqa

There are 4 repositories under textvqa topic.

facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python5.6k 110 657940
yashkant/sam-textvqa
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
Language:Python64 3 2013
phiyodr/vqaloader
PyTorch DataLoader for many VQA datasets
Language:Python11 1 01
soonchangAI/LFPR
[PRL 2024] This is the code repo for our label-free pruning and retraining technique for autoregressive Text-VQA Transformers (TAP, TAP†).
Language:Python2 2 00