/ST-VQA_Loc

Multimodal grid features and cell pointers for Scene Text Visual Question Answering

Primary LanguagePython

Stargazers