/Megatron

Ongoing research training transformer models at scale

Primary LanguagePythonOtherNOASSERTION

Watchers

No one’s watching this repository yet.