helloWaterM/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PythonNOASSERTION
Stargazers
No one’s star this repository yet.
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PythonNOASSERTION
No one’s star this repository yet.