/Megatron-LM_HBFP

ColTraIn's fork of the Megatron-LM project with Hybrid Block Floating-Point (HBFP) training capability

Primary LanguagePythonOtherNOASSERTION

Stargazers