video-retrieval

There are 56 repositories under video-retrieval topic.

OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.4k 28 19388
jayleicn/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Language:Python706 9 5986
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Language:Python559 12 4260
albanie/collaborative-experts
Video embeddings for retrieval with natural language queries
Language:Python336 10 2955
X-PLUG/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Language:Python285 5 3011
jayleicn/moment_detr
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
Language:Python271 10 5945
X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Language:Python220 5 2518
MKLab-ITI/visil
Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]
Language:Python208 10 2238
wjun0830/QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Language:Python207 4 4616
jayleicn/TVRetrieval
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Language:Python153 8 1324
tsujuifu/pytorch_violet
A PyTorch implementation of VIOLET
Language:Python137 9 176
jpthu17/EMCL
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Language:Python121 3 49
jpthu17/DiffusionRet
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Language:Python120 3 106
MKLab-ITI/ndvr-dml
Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]
Language:Python118 6 1618
jpthu17/HBI
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Language:Python109 4 75
j-min/HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
Language:Python95 5 139
foolwood/DRL
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
Language:Python93 4 05
mever-team/distill-and-select
Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 2022]
Language:Python65 10 119
transvcl/TransVCL
TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]
Language:Python54 3 56
jpthu17/DiCoSA
[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
Language:Python49 2 92
zchoi/PKOL
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
Language:Python46 2 10
Sy-Zhang/MMC-PCFG
Video-aided Unsupervised Grammar Induction, NAACL‘21 [best long paper]
Language:Python40 2 14
4ML-platform/ndvr
Near Duplicate Video Retrieval
Language:Python39 2 12
tsujuifu/pytorch_empirical-mvm
A PyTorch implementation of EmpiricalMVM
Language:Python39 2 92
TXH-mercury/COSA
Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Language:Python39 2 43
gkordo/s2vs
Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]
Language:Python38 2 42
martinetoering/ViCC
[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Language:Python37 2 48
mlvlab/MELTR
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
Language:Python32 7 56
willyfh/awesome-video-text-datasets
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
31 2 03
li-xirong/w2vvpp
W2VV++: A fully deep learning solution for ad-hoc video search
Language:Python28 8 015
xwen99/temporal_context_aggregation
Temporal Context Aggregation for Video Retrieval with Contrastive Learning, WACV 2021
Language:Python27 3 73
callsys/TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Language:Python22 2 30
Arun-George-Zachariah/awesome-video-retrieval-papers
List of resources for video retrieval.
Language:TeX17 3 01
Adamouization/Content-Based-Video-Retrieval-Code
Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.
Language:Python16 5 05
gimpong/WWW22-HCQ
The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).
Language:Python16 2 24
danielchyeh/this-is-my
Official This-Is-My Dataset published in CVPR 2023
Language:Python15 2 52

video-retrieval

OpenGVLab/InternVideo

jayleicn/ClipBERT

Vision-CAIR/MiniGPT4-video

albanie/collaborative-experts

X-PLUG/Youku-mPLUG

jayleicn/moment_detr

X-PLUG/mPLUG-2

MKLab-ITI/visil

wjun0830/QD-DETR

jayleicn/TVRetrieval

tsujuifu/pytorch_violet

jpthu17/EMCL

jpthu17/DiffusionRet

MKLab-ITI/ndvr-dml

jpthu17/HBI

j-min/HiREST

foolwood/DRL

mever-team/distill-and-select

transvcl/TransVCL

jpthu17/DiCoSA

zchoi/PKOL

Sy-Zhang/MMC-PCFG

4ML-platform/ndvr

tsujuifu/pytorch_empirical-mvm

TXH-mercury/COSA

gkordo/s2vs

martinetoering/ViCC

mlvlab/MELTR

willyfh/awesome-video-text-datasets

li-xirong/w2vvpp

xwen99/temporal_context_aggregation

callsys/TextVR

Arun-George-Zachariah/awesome-video-retrieval-papers

Adamouization/Content-Based-Video-Retrieval-Code

gimpong/WWW22-HCQ

danielchyeh/this-is-my