SkandaBhat/swarm
Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"
Python
No issues in this repository yet.
Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"
Python
No issues in this repository yet.