/Megatron-DeepSpeed-JP-ABCI

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Primary LanguagePythonOtherNOASSERTION

Stargazers

No one’s star this repository yet.