video-description

There are 12 repositories under video-description topic.

jssprz/video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Language:Jupyter Notebook117 3 112
willyfh/awesome-video-text-datasets
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
33 2 03
jssprz/visual_syntactic_embedding_video_captioning
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
Language:Python30 2 128
dialogtekgeek/AVSD-DSTC10_Official
Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
27 6 02
jssprz/attentive_specialized_network_video_captioning
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
Language:Python15 2 123
AmrHendy/video-content-description
Video content description technique for generating descriptions for unconstrained videos.
Language:Jupyter Notebook9 2 02
OwenEdwards/videojs-speak-descriptions-track
A Video.js 7 middleware that uses browser speech synthesis to speak descriptions contained in a description text track
Language:JavaScript6 2 31
AmrHendy/multimedia_question_answering
A simple attention deep learning model to answer questions about a given video with the most relevant video intervals as answers.
Language:Python2 3 04
willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
2 1 00
Ayeshaaaaaaaaa/Video-Description-and-Summarization-Using-BLIP-and-BART-Models
This project processes videos by extracting frames, generating detailed visual descriptions for each frame using the BLIP model, and then summarizing these descriptions with the BART model.
Language:Jupyter Notebook1 0
crim-ca/FrVD
FrVD: French Video Description dataset
6 0
crim-ca/FrVD-visualization-tool
Tool employed to visualize synchronized FrVD metadata and videos simultaneously.
Language:Python4 0

video-description

jssprz/video_captioning_datasets

willyfh/awesome-video-text-datasets

jssprz/visual_syntactic_embedding_video_captioning

dialogtekgeek/AVSD-DSTC10_Official

jssprz/attentive_specialized_network_video_captioning

AmrHendy/video-content-description

OwenEdwards/videojs-speak-descriptions-track

AmrHendy/multimedia_question_answering

willyfh/msvd-indonesian

Ayeshaaaaaaaaa/Video-Description-and-Summarization-Using-BLIP-and-BART-Models

crim-ca/FrVD

crim-ca/FrVD-visualization-tool