/ClipClap-GZSL

Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models

Primary LanguagePythonMIT LicenseMIT