pytorch/torchtitan

Wrong mesh order

ad8e opened this issue · 1 comments

ad8e commented

[self.dp, self.tp, self.pp], ["dp", "tp", "pp"], strict=True

Should be PP DP TP. This matters for NUMA across nodes.

good catch! submitting a fix soon