ymcui/Chinese-BERT-wwm

求问不同模型的schedule细节

NinedayWang opened this issue · 2 comments

请问不同模型(bert、roberta、macbert的base/large)的学习率和warmup是怎么设置的呢

stale commented

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale commented

Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.