/keras-LAMB-Optimizer

Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.