/Megatron-LM

Ongoing research training transformer models at scale

Primary LanguagePythonOtherNOASSERTION

Watchers