video-language-understanding
There are 14 repositories under video-language-understanding topic.
whwu95/Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
whwu95/BIKE
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
YangLing0818/EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
doc-doc/NExT-GQA
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
liveseongho/Awesome-Video-Language-Understanding
A Survey on video and language understanding.
sail-sg/VGT
Video Graph Transformer for Video Question Answering (ECCV'22)
houzhijian/CONQUER
[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval
MikeWangWZHL/Paxion
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
houzhijian/CONE
[2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
zinengtang/DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
doc-doc/CoVGT
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
houzhijian/GroundNLQ
The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023
Maddy12/SSL4VideoSurvey
The official GitHub page for the survey paper "Self-Supervised learning for Videos: A survey"
jena-shreyas/Awesome-Video-Language-Resources
A repository of Video Language papers, code and datasets.