PKU-DAIR/Hetu-Galvatron
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).
Python
Issues
- 0
How to Use
#4 opened by tisgotos - 2
Error when train galvatron with global mode CUDA error: uncorrectable ECC error encountered
#3 opened by CannonWWW - 0
question about the config file
#2 opened by hailuoS - 0
How to use it?
#1 opened by robertLiuLinFeng