About the number of OCR in stvqa dataset

Question

About the number of OCR in stvqa dataset

JayZhu0104 opened this issue 3 years ago · 1 comments

Hi！
I found that the number of words detected by OCR in some pictures in stvqa dataset is inconsistent with the corresponding feature number.
For example, the number of features in 'feat_resx/stvqa/train/imageNet/n03196217_ 7957. npy' is 33, while the number of OCR words in the corresponding 'ocr_ feat_ resx/stvqa_ conf/train/imageNet/n03196217_ 7957_info. npy' is 55. The two numbers do not match. About 2000 pictures have this problem in train dataset.

Answer 1 · 2021-10-07T19:59:57.000Z

Updated the corresponded files :)