/out-of-the-box-fp8-training

Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.

Primary LanguageJupyter NotebookMIT LicenseMIT

Watchers