/gradient-rounding

Round the gradient during LLM training to different degrees; compare "scaling" of rounding to different significant digits to parameter scaling

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers

No one’s star this repository yet.