Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
Primary LanguagePython