musram/LLM-Pretraining-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PythonNOASSERTION
Watchers
No one’s watching this repository yet.
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PythonNOASSERTION
No one’s watching this repository yet.