Pinned Repositories
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
ShivamSharma2705's Repositories
ShivamSharma2705/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2