Fine-tuning the hyperparameters

Question

Fine-tuning the hyperparameters

Closed this issue 5 years ago · 2 comments

ayush2051 commented 5 years ago

Hi Tom,
Did you fine-tune any of the hyper-parameters that u have shared in the FLAGS file as I tried with the same and got a BLEU score of 14.17, NLL- 40.33.; with the GLOVE vocab, PG set to FALSE. Can u let me know the exact split of the dev and test set samples that you used for the evaluation as this high score may be biased due to some different split I took?

Answer 1 · 2019-07-26T08:18:42.000Z

Hi, I'd strongly recommend using the split from here as it has become the standard split in the literature:
https://github.com/tomhosking/squad-du-split

I didn't do a full hyperparameter optimisation, but did play around with them a bit during development.

Answer 2 · 2019-07-27T18:58:05.000Z

Thanks a lot Tom!! will try a few combinations on the top of that.