microsoft/X-Decoder

Question about box prediction

Opened this issue · 0 comments

In this line (below), it seems that the code uses masks to predict boxes,

result.pred_boxes = BitMasks(mask_pred > 0).get_bounding_boxes()

but in line 913, the predicted boxes are already obtained.

box_pred = box_pred[keep]

Why do not use predicted bboxes directly?