Ze-Yang/Context-Transformer

Issue about the COCO dataset

Closed this issue · 7 comments

Hi, thanks for your work.
In 'split_coco_dataset_voc_nonvoc.py', you split 3 annotation files, but I can't find 'instances_valminusminival2014.json' and 'instances_minival2014.json' in the official website of COCO. Could you tell me where I can get these files?

Hi, thanks for your work.
In 'split_coco_dataset_voc_nonvoc.py', you split 3 annotation files, but I can't find 'instances_valminusminival2014.json' and 'instances_minival2014.json' in the official website of COCO. Could you tell me where I can get these files?

You can download from this link.

Thanks for your reply!
I have another question about the prior boxes. In the original SSD, the output of its model includes prior box layers, which contain the coordinate information of all default boxes. In your code, however, these coordinates are computed out of the model. Instead, 'obj' is computed in the model, which is the output of a convolutional layer. What does this 'obj' part stand for?
WK((S)T$UPA)7 IFS4AH8H5

FYI, the prior box is the same as default boxes in SSD, as stated in Sec 4.1 in the our paper. Such boxes are predefined and don't require any update during training. Therefore, it's more efficient to compute it once out of the model, which is also applied in RFBNet.

As for the 'obj', it's a foreground/background classifier, somewhat like the objectness score (RPN output) in two-stage detector Faster RCNN. For details, please refer to Sec 3 of our paper. Thanks.

If I didn't get it wrong, this 'obj' part is the background classifier(BG) in your paper.

Exactly.

I see. Thanks for your help!