timvink opened this issue 5 years ago · 1 comments
Describe the Bug
The README example states:
RAdam(total_step=10000, warmup_proportion=0.1, min_lr=1e-5)
Should be total_steps:
-RAdam(total_step=10000, warmup_proportion=0.1, min_lr=1e-5) +RAdam(total_steps=10000, warmup_proportion=0.1, min_lr=1e-5)
PR in #14