Are the video titles and tags provided by the creators, or generated by some video-language models?
Closed this issue · 1 comments
CleverPaul commented
I am curious about how these features, such as titles and tags, are obtained. Are they manually provided by the video creators, or are they automatically generated through advanced video-text models?
yxni98 commented
Hi @CleverPaul , all video and related data are created by the human creators, no generation data involved in the dataset. Please see our paper for the process of dataset establishment.