textvqa

There are 4 repositories under textvqa topic.

  • facebookresearch/mmf

    A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

    Language:Python5.5k114656934
  • yashkant/sam-textvqa

    Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.

    Language:Python6232013
  • phiyodr/vqaloader

    PyTorch DataLoader for many VQA datasets

    Language:Python8201
  • soonchangAI/LFPR

    [PRL 2024] This is the code repo for our label-free pruning and retraining technique for autoregressive Text-VQA Transformers (TAP, TAP†).

    Language:Python0200