/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT

Primary LanguagePythonOtherNOASSERTION

Watchers

No one’s watching this repository yet.