This is the dataset for the paper Script-to-Storyboard: A New Contextual Retrieval Dataset and Benchmark
- You need first request access to the dataset from Large Scale Movie Description Challenge (LSMDC)
- Download the video and annotation files using the following steps:
- Download the video downloading scripts to
LSMDC
folder - Switch to
LSMDC
folder usingcd LSMDC
- run with
bash downloadChallengeData.sh <username-MPIIMD> <password-MPIIMD>
where<username-MPIIMD>
and<password-MPIIMD>
are the username and password you used to request access to the dataset
- Download the video downloading scripts to
- Run
python main.py
to extract the frames from movie clips
Download the Sc2St dataset in Google Drive The dataset structure is as follows:
Sc2St
├── story10_all
│ ├── story10_all.json
│ ├── story10_all_val.json
│ ├── story10_all_test.json
│ ├── story10_all_train.json
│ ├── story10_all_trainval.json
│ ├── story10_all_train_no.json
│ ├── ...
│ ├── story10_all_val_no.json
├── i2chars.json
│── parsed_clip.json
│── parsed_text.json
The story10_all
indicates the dataset is for storyboard with 10 images in a story
, where all
means it uses all the movies.
The .json
files contains id
that is a unique identifier for each movie clip frame in the format l_1_2_1
, which l
means LSMDC, and the three numbers are movie_id
, clip_id
, and frame_id
respectively.
The parsing code can be found in lsmdc_utils.py
.
If the id has only 2 digits, it represents the movie_id
and clip_id
respectively, for example, l_1_2
means the movie with id 1
and the clip with id 2
.
The json file ending with train
or val
or test
indicates the data split for training, validation, and testing respectively.
The json file ending with _no
indicates there is no overlapping of used frames across splits train
, val
, and test
.
These raw information are used in building the dataset, provided here for reference.
i2chars.json
contains the mapping from theclip id
to characters in that frame.parsed_clip.json
contains the mapping ofclip id
to its parsed keyframes indices.parsed_text.json
contains the mapping ofclip id
to its parsed text. A clip may have multiple text descriptions.
data
folder contains intermediate prepared data for LSMDC dataset.lsmdc.json
is the indexed LSMDC dataset frame directory structure, containing id for all movies, clips, and frames. Use this file together withlsmdc_utils.py
to parse the id.meta.csv
is the metadata for LSMDC dataset, containing information including movie genres.
lsmdc_utils.py
contains the parsing code for LSMDC dataset.main.py
contains the code for extracting frames from movie clips.utils.py
contains utility functions.