krinshi's Stars
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Weixin-Liang/MetaShift
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)
JingweiJ/ActionGenome
A video database bridging human actions and human-object relationships
gunthercox/chatterbot-corpus
A multilingual dialog corpus
ttengwang/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
antoine77340/Mixture-of-Embedding-Experts
Mixture-of-Embeddings-Experts
jcjohnson/densecap
Dense image captioning in Torch
ttengwang/dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
ttengwang/PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
JaywongWang/DenseVideoCaptioning
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2018, with code, model and prediction results.
xiadingZ/video-caption.pytorch
pytorch implementation of video captioning