activitynet-captions
There are 10 repositories under activitynet-captions topic.
v-iashin/BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
ttengwang/PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
jayleicn/recurrent-transformer
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
v-iashin/MDVC
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
WuJie1010/Awesome-Temporally-Language-Grounding
A curated list of “Temporally Language Grounding” and related area
jssprz/video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
WuJie1010/Temporally-language-grounding
A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"
ttengwang/dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
LuoweiZhou/densecap
Dense video captioning in PyTorch
jssprz/video_features_extractor
Python implementation of extraction of several visual features representations from videos