/VLCAP

[ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Primary LanguageJupyter Notebook

Stargazers