video-description
There are 12 repositories under video-description topic.
jssprz/video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
willyfh/awesome-video-text-datasets
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
jssprz/visual_syntactic_embedding_video_captioning
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
dialogtekgeek/AVSD-DSTC10_Official
Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
jssprz/attentive_specialized_network_video_captioning
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
AmrHendy/video-content-description
Video content description technique for generating descriptions for unconstrained videos.
OwenEdwards/videojs-speak-descriptions-track
A Video.js 7 middleware that uses browser speech synthesis to speak descriptions contained in a description text track
AmrHendy/multimedia_question_answering
A simple attention deep learning model to answer questions about a given video with the most relevant video intervals as answers.
willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
Ayeshaaaaaaaaa/Video-Description-and-Summarization-Using-BLIP-and-BART-Models
This project processes videos by extracting frames, generating detailed visual descriptions for each frame using the BLIP model, and then summarizing these descriptions with the BART model.
crim-ca/FrVD
FrVD: French Video Description dataset
crim-ca/FrVD-visualization-tool
Tool employed to visualize synchronized FrVD metadata and videos simultaneously.