Visual Question Answering using Transformer and Bottom-Up attention. Implemented in Pytorch
Primary LanguagePythonMIT LicenseMIT