Image Text Retrieval Loss Discussion
jipson7 opened this issue · 0 comments
jipson7 commented
Hi.
Why did you decide to use a BCE loss on the ITM pretraining text and a ranking loss on the ITM downstream task? Is there any intuition behind this? Why not use a ranking loss on both?
Thanks for the great work.
-Caleb