My implementation with better results based on reasonable hyperparameters
waterhorse1 opened this issue · 0 comments
waterhorse1 commented
Dear authors,
I have reproduced the algorithm in the paper. In your original paper, you set the inner loop learning rate to 5e-5 and outer-loop learning rate to 5e-6, which from my perspective is too low to have a good learning process. So I reset the parameters and test the MAE of my implementation. And it turns out to be better than your results.
So I wonder whether the hyperparameters in your paper isn't proper or is there any other reasons to lower inner and outer learning rate to that extent.
My implementation is in:
https://github.com/waterhorse1/MELU_pytorch