This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision models and pretrained language models for visual question answering (VQA) task in Vietnamese.
Primary LanguagePython