shabie/docformer

There is no SEP token appended

Closed this issue · 1 comments

You have a [SEP] or an equivalent token at the end which I think is not what the authors used:

token_boxes = [[0, 0, 0, 0]] + token_boxes + [[1000, 1000, 1000, 1000]]
unnormalized_token_boxes = [[0, 0, 0, 0]] + unnormalized_token_boxes + [[1000, 1000, 1000, 1000]]

See the first paragraph of the sub-section "Language Features" in the section 3.1

You can check it now, I have updated it, and for the bounding boxes as well, I removed the [1000,1000,1000,1000]