/Finetune_llama2_Megatron

Using megatron style to do TP training.

Primary LanguagePython

Watchers