There seems to be a bug at datasets/bases.py#L157.

Question

There seems to be a bug at datasets/bases.py#L157.

Opened this issue a year ago · 1 comments

At datasets/bases.py#L157, you directly pass caption_tokens to the function _build_random_masked_tokens_and_labels and caption_tokens has be modified in this function. Thus, the masked captions are also used in the sdm task and id task, which is inconsistent with the clarification in the paper.

Answer 1 · 2023-05-27T00:11:16.000Z

Please see the reponse at #13