kohjingyu/fromage

Can I use the embedding for training

Closed this issue · 2 comments

Hello, thank you for your work. I would like to ask if I can directly use cc3m_embeddings.pkl for model training? Or do I need to download the cc3m dataset? Looking forward to your reply.

For training the model you would most likely have to use the CC3M dataset. The embeddings are mostly used as retrieval candidates during inference.

Thank you!