(PRCV'2022) CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter
Primary LanguageJupyter Notebook