Can't reproduce the results for GLUE and hyperparameter misalignment

Question

Can't reproduce the results for GLUE and hyperparameter misalignment

nbasyl opened this issue a year ago · 4 comments

Hi,
Thanks for the great work.

I am trying to reproduce the result of Roberta-large on the NLU tasks, however, I got a CoLA score = 0 and MNLI = 31.3 using the provided finetuning scripts, and then I found out that there are misalignments between the hyperparameters in the provided training scripts and those on the paper. For example, in roberta_large_cola.sh the lr is set to 3e-4, but in the paper, it is set to 2e-4. Which settings should I follow to reproduce the reported result?

looking forward to your reply!

Best,
Sean

Answer 1 · 2023-11-22T09:26:07.000Z

I changed the lr in the CoLA training script to 2e-4 and solved the CoLA constant 0 eval correlation value problem, but still couldn't reproduce the MNLI result :(

Answer 2 · 2023-11-22T09:28:42.000Z

But I am still only getting 62.82 CoLA score, anyone encountered similar problem when trying to reproduce the result

Answer 3 · 2024-03-17T02:10:28.000Z

But I am still only getting 62.82 CoLA score, anyone encountered similar problem when trying to reproduce the result

Hi，Did you solve this problem?

Answer 4 · 2024-04-14T11:12:30.000Z

I changed the lr in the CoLA training script to 2e-4 and solved the CoLA constant 0 eval correlation value problem, but still couldn't reproduce the MNLI result :(

My result in CoLA is 63.48 which matches the paper. And the random seeds used are (1 3 13 37 71), but I can not reproduce other task, only CoLA can match the paper.