Zeyu-ZEYU/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Jupyter NotebookNOASSERTION
No issues in this repository yet.
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Jupyter NotebookNOASSERTION
No issues in this repository yet.