Cross-modal retrieval on COCO

Question

Cross-modal retrieval on COCO

katerynaCh opened this issue 2 years ago · 2 comments

Hi! Thank you for your work! I have a question regarding cross-modal retrieval on COCO, I am struggling to reproduce the results reported in the paper. Could you provide more details on your training protocol? Which coefficients are you using for VICReg/what kind of expander architecture/are you doing any further downstream training or do you directly use the encoder embeddings obtained via ssl?

Thank you!

Answer 1 · 2023-01-11T09:45:08.000Z

Hi,
I am facing the same issue to reproduce the results of the paper. Did you find the right configuration?

Answer 2 · 2023-06-01T15:00:03.000Z

I am facing the same issue, too!