Issues
- 2
异构训练报NCCL错误
#201 opened by echo-valor - 0
从checkpoint中加载模型进行增量预训练
#200 opened by echo-valor - 0
[BUG or ENHANCEMENT] about SSH runner
#193 opened by shh2000 - 2
- 2
请问pp异构并行训练时,划分模型层列表是否只能通过手动设置?
#153 opened by JinXiaozhao - 2
[QUESTION] 支持在异构设备上进行训练吗?
#31 opened by slbqc - 1
llama2 70B模型在不同PP下loss下降趋势不同
#81 opened by ZLkanyo009 - 0
期望新增以下切割模型权重的功能
#68 opened by helen88 - 1
- 1
[QUESTION] Support other hardware?
#16 opened by SueeH - 3
Support Lora or other peft
#1 opened by 2033329616