Cross-modal retrieval on COCO
katerynaCh opened this issue · 2 comments
katerynaCh commented
Hi! Thank you for your work! I have a question regarding cross-modal retrieval on COCO, I am struggling to reproduce the results reported in the paper. Could you provide more details on your training protocol? Which coefficients are you using for VICReg/what kind of expander architecture/are you doing any further downstream training or do you directly use the encoder embeddings obtained via ssl?
Thank you!
firasl commented
Hi,
I am facing the same issue to reproduce the results of the paper. Did you find the right configuration?
MauritsBleeker commented
I am facing the same issue, too!