attesaarela/Sophia
Copy of the official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
PythonMIT
Stargazers
No one’s star this repository yet.
Copy of the official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
PythonMIT
No one’s star this repository yet.