video-text-retrieval
There are 16 repositories under video-text-retrieval topic.
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
microsoft/UniVL
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
whwu95/Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
salesforce/ALPRO
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
m-bain/CondensedMovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
xuguohai/X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
alipay/Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
amazon-science/crossmodal-contrastive-learning
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
LeapLabTHU/Cross-Modal-Adapter
[arXiv] Cross-Modal Adapter for Text-Video Retrieval
RenShuhuai-Andy/TESTA
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
shufangxun/MAC
An end-to-end masked contrastive video-and-language pre-training framework
unitaryai/VTC
VTC: Improving Video-Text Retrieval with User Comments
rn-snehapriya/Automatic-Note-Taking-From-Video-Using-Tesseract-OCR
Text from the video is extracted and saved into a .docx file in the form of notes.
Jazz1996/tech_review
Survey of state-of-art video-text retrieval methods.