PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
Primary LanguageJupyter NotebookMIT LicenseMIT