Zeyu-ZEYU/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Jupyter NotebookNOASSERTION
Watchers
No one’s watching this repository yet.
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Jupyter NotebookNOASSERTION
No one’s watching this repository yet.