[TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
Primary LanguageJupyter NotebookApache License 2.0Apache-2.0