Could you please provide the checkpoints of MRQA datasets?
Closed this issue · 6 comments
Hi, I have included the available checkpoints, although not all of them are saved.
Thank for your reply. I am having some difficulties reproducing the MRQA dataset. Does these experiments require prompt initialization from other datasets?
No, there is no need to do any transfer learning for initialization. From my experience, just training longer typically helps.
Thanks for your question. This is only for the few-shot learning experiments.
In general, transfer learning can improve the speed of the convergence and the model performance. However, it might take a while or require some tricks to select the best source tasks. Therefore, in our experiments, when there are enough training examples, we typically train the soft prompt from the random initialization. In these cases, we find that training longer typically leads to better performance.
Does training require longer for other datasets? Could you please provide the checkpoints of the other datasets?