henghuiding/Vision-Language-Transformer

Confusion about data_process_v2

Opened this issue · 0 comments

Hello, I just checked the file 'data/data_process_v2.py', and I found something confusing.

Since in line 98 you check 'if dataset == 'refclef', apparently, you take RefClef dataset into account, not only RefCoco, Refcoco+, Refcocog. But should categories in Refclef be processed the same way like Refcoco*, as in cat_process function? I guess the cat_process function is to convert COCO 91-category to 80-category. I wonder if this works to Refclef similarly?

By the way, still in line 98, why should ['19579.jpg', '17975.jpg', '19575.jpg'] be excluded? Is there any explanation?

Your reply would be highly appreciated, thanks :)