should nproc-per-node equals to sequence_parallel_size?
Edwardmark opened this issue · 3 comments
Edwardmark commented
Great work. I have a question, should nproc-per-node equals to sequence_parallel_size when do open-sora inference using DSP?
Thanks in advance.
oahzxl commented
sp size should be as small as possible
Edwardmark commented
@oahzxl why sp size should be as small as possible?
oahzxl commented
because the essense of sequence parallel is to accomodate long sequence at the cost of extra communication cost. so if memory is enough, keep it low