fartashf/vsepp

The number of COCO validation images

KevinLight831 opened this issue · 1 comments

In the COCO dataset, the number of validation images is 1000, but I found many papers claimed the number is 5000.

There are 5000 test images as well as more than 5000 validation images in the original split of MSCOCO. Please see Section 3.1 of the paper for the details and references (https://arxiv.org/pdf/1707.05612.pdf) and the corresponding lines in the code:

vsepp/data.py

Lines 51 to 59 in 7f8ffd5

ids['train'] = np.load(os.path.join(capdir, 'coco_train_ids.npy'))
ids['val'] = np.load(os.path.join(capdir, 'coco_dev_ids.npy'))[:5000]
ids['test'] = np.load(os.path.join(capdir, 'coco_test_ids.npy'))
ids['trainrestval'] = (
ids['train'],
np.load(os.path.join(capdir, 'coco_restval_ids.npy')))
if use_restval:
roots['train'] = roots['trainrestval']
ids['train'] = ids['trainrestval']