Where did you train the model?
Closed this issue · 1 comments
filopedraz commented
Did you use Deepspeed to train the model?
hunterhector commented
The backend is simply FSDP, https://github.com/LLM360/amber-train/blob/main/main.py#L129
Closed this issue · 1 comments
Did you use Deepspeed to train the model?
The backend is simply FSDP, https://github.com/LLM360/amber-train/blob/main/main.py#L129