video-retrieval

There are 56 repositories under video-retrieval topic.

  • OpenGVLab/InternVideo

    [ECCV2024] Video Foundation Models & Data for Multimodal Understanding

    Language:Python1.4k2819388
  • jayleicn/ClipBERT

    [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

    Language:Python70695986
  • Vision-CAIR/MiniGPT4-video

    Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

    Language:Python559124260
  • albanie/collaborative-experts

    Video embeddings for retrieval with natural language queries

    Language:Python336102955
  • X-PLUG/Youku-mPLUG

    Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

    Language:Python28553011
  • jayleicn/moment_detr

    [NeurIPS 2021] Moment-DETR code and QVHighlights dataset

    Language:Python271105945
  • X-PLUG/mPLUG-2

    mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

    Language:Python22052518
  • MKLab-ITI/visil

    Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]

    Language:Python208102238
  • wjun0830/QD-DETR

    Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

    Language:Python20744616
  • jayleicn/TVRetrieval

    [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

    Language:Python15381324
  • tsujuifu/pytorch_violet

    A PyTorch implementation of VIOLET

    Language:Python1379176
  • jpthu17/EMCL

    [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

    Language:Python121349
  • jpthu17/DiffusionRet

    [ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

    Language:Python1203106
  • MKLab-ITI/ndvr-dml

    Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]

    Language:Python11861618
  • jpthu17/HBI

    [CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

    Language:Python109475
  • j-min/HiREST

    Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)

    Language:Python955139
  • foolwood/DRL

    [arXiv22] Disentangled Representation Learning for Text-Video Retrieval

    Language:Python93405
  • mever-team/distill-and-select

    Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 2022]

    Language:Python6510119
  • transvcl/TransVCL

    TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]

    Language:Python54356
  • jpthu17/DiCoSA

    [IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

    Language:Python49292
  • zchoi/PKOL

    [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”

    Language:Python46210
  • Sy-Zhang/MMC-PCFG

    Video-aided Unsupervised Grammar Induction, NAACL‘21 [best long paper]

    Language:Python40214
  • 4ML-platform/ndvr

    Near Duplicate Video Retrieval

    Language:Python39212
  • tsujuifu/pytorch_empirical-mvm

    A PyTorch implementation of EmpiricalMVM

    Language:Python39292
  • TXH-mercury/COSA

    Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

    Language:Python39243
  • gkordo/s2vs

    Authors official PyTorch implementation of the "Self-Supervised Video Similarity Learning" [CVPRW 2023]

    Language:Python38242
  • martinetoering/ViCC

    [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.

    Language:Python37248
  • mlvlab/MELTR

    MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)

    Language:Python32756
  • willyfh/awesome-video-text-datasets

    A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.

  • li-xirong/w2vvpp

    W2VV++: A fully deep learning solution for ad-hoc video search

    Language:Python288015
  • xwen99/temporal_context_aggregation

    Temporal Context Aggregation for Video Retrieval with Contrastive Learning, WACV 2021

    Language:Python27373
  • callsys/TextVR

    [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension

    Language:Python22230
  • Arun-George-Zachariah/awesome-video-retrieval-papers

    List of resources for video retrieval.

    Language:TeX17301
  • Adamouization/Content-Based-Video-Retrieval-Code

    Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.

    Language:Python16505
  • gimpong/WWW22-HCQ

    The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).

    Language:Python16224
  • danielchyeh/this-is-my

    Official This-Is-My Dataset published in CVPR 2023

    Language:Python15252