求问不同模型的schedule细节
NinedayWang opened this issue · 2 comments
NinedayWang commented
请问不同模型(bert、roberta、macbert的base/large)的学习率和warmup是怎么设置的呢
stale commented
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
stale commented
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.