format explanation for the grounding annotation file

Question

format explanation for the grounding annotation file

Closed this issue a year ago · 3 comments

any explanation or instruction for how to understand the format of the annotation files, expression and labels_with_ids?
how to link each other.

Answer 1 · 2023-04-25T07:33:41.000Z

Each expression corresponds to multiple frames, each frame containing multiple labels_with_ids.

Sorry for the confusion, and we will update the details.

Answer 2 · 2023-04-25T16:51:47.000Z

Thanks for the reply!
I still have no idea where to start. For example, for an expression, black-cars-in-right.json, how can I find its corresponding image and associated bbox?

Answer 3 · 2023-06-26T05:45:20.000Z

When you open a JSON file, its corresponding image sequence names can be found in terms of its folder name.
Our JSON file have only corresponding ids, so the corresponding box can be seen from 'label_with_ids'.