/VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Primary LanguagePythonMIT LicenseMIT

Issues