/Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Primary LanguagePythonOtherNOASSERTION

Watchers

No one’s watching this repository yet.