Pinned Repositories
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
panyuyang's Repositories
panyuyang/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2