parsa-epfl/Megatron-LM_HBFP
ColTraIn's fork of the Megatron-LM project with Hybrid Block Floating-Point (HBFP) training capability
PythonNOASSERTION
ColTraIn's fork of the Megatron-LM project with Hybrid Block Floating-Point (HBFP) training capability
PythonNOASSERTION