meshidenn/Megatron-DeepSpeed-JP-ABCI
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PythonNOASSERTION
Stargazers
No one’s star this repository yet.
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PythonNOASSERTION
No one’s star this repository yet.