[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
Primary LanguagePythonMIT LicenseMIT