Pinned Repositories
latr
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)
mxw20010804's Repositories
mxw20010804 doesn’t have any repository yet.
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)
mxw20010804 doesn’t have any repository yet.