/cosum

Data / annotations for video co-summarization (CVPR15)

#Preview

The 10 categories collected in this dataset. See discovery demo here.

#Descrptions

This Co-Sum dataset serves as a benchmark to validate video co-summarization techniques, where the goal is to create video summaries given a video collection of the same topic. The dataset is collected from YouTube using 10 queries, in total 51 videos of 147m40s. We release the video URLs, proprocessed shot indices and annotations for reproducibility of our results.

This dataset has been evaluated on two tasks:

  • Adaptive video summarization: Create summaries for each video adaptive to a query string
  • Concept visualization: Generate visual (video) concepts from a query string (eg, "surf" and "bike polo")

More info:

@inproceedings{chu2015video,
    title={Video co-summarization: Video summarization by visual co-occurrence},
    author={Chu, Wen-Sheng and Song, Yale and Jaimes, Alejandro},
    booktitle={CVPR},
    year={2015}
}

#Shot Indices

The shot indices used in the paper can be found in the shots/ directory. Note that the indices can be post-processed into smaller shots to avoid too lengthy shots (see Sec 3.1 in paper).

#Video URLs

01: Base jump

base1 base2 base3 base4 base4


02: Bike polo

bike1 bike2 bike3 bike4 bike5


03: Eiffel tower

eiffel1 eiffel2 eiffel3 eiffel4 eiffel5 eiffel6 eiffel7


04: Excavators river cross

exc1 exc2 exc3


05: Kids play in leaves

kid1 kid2 kid3 kid4 kid5 kid6


06: MLB

mlb1 mlb2 mlb3 mlb4 mlb5 mlb6


07: NFL

nfl1 nfl2 nfl3


08: Notre dame cathedral

notre1 notre2 notre3 notre4 notre5


09: Statue of liberty

statue1 statue2 statue3 statue4 statue5


10: Surf

surf1 surf2 surf3 surf4 surf5 surf6