Regarding the data of the VIST Dataset

Question

Regarding the data of the VIST Dataset

Opened this issue 2 years ago · 1 comments

Hi @xichenpan
When I tried to reproduce the experiment on the VIST dataset, I noticed that there are numerous duplicate story images in the testing set as illustrated in the figure below, although their text descriptions differ. Is this because some image URLs were inaccessible during the download process? I utilized the vist_img_download.py script to download a total of 184011 images, but I am unsure if some images may have been missing. Would it be possible for you to share the vist.h5 file through Google Drive?

Answer 1 · 2023-03-13T07:57:19.000Z

Hi @SaulZhang , sorry for the delayed reply, I was busy with ICCV last week. The VIST dataset does contain duplicate images. It is due to for a same visual stories, there are multiple captions, so your downloaded images are actually correct.
Unfortunately, I do not have a vist.h5 file, because we use nas table in Alibaba for data storage, and these h5 scripts are only written for users to accelerate IO. Hope this can help!