singhranjodh/keras-LAMB-Optimizer
Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
PythonMIT
No issues in this repository yet.
Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
PythonMIT
No issues in this repository yet.