microsoft/X-Decoder

Sampling strategy

Closed this issue · 1 comments

Is the sampling strategy exactly the same as UniCL? Could you explain this more specifically? Thanks.

For sampling strategy for training X-decoder, we sample 32 coco image and 1024 image text pair images. Please let us know if you need any more details : )