【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Primary LanguagePythonMIT LicenseMIT