/Megatron-DeepSpeed

Intel Gaudi's Megatron DeepSpeed Large Language Models for training

Primary LanguagePythonOtherNOASSERTION

Watchers