/CenterCLIP

[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast pyav video decoding.

Primary LanguagePythonOtherNOASSERTION

Watchers