ChenRocks/UNITER

Image Text Retrieval Loss Discussion

jipson7 opened this issue · 0 comments

Hi.

Why did you decide to use a BCE loss on the ITM pretraining text and a ranking loss on the ITM downstream task? Is there any intuition behind this? Why not use a ranking loss on both?

Thanks for the great work.

-Caleb