[CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Primary LanguagePython