tczhangzhi/pytorch-parallel
Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
PythonMIT
Stargazers
- acknessShenZhen, China
- arcsinx11
- back2yes
- baifanysu
- brycexuCarnegie Mellon University
- ccx1997
- chingswy
- chwbin
- DTennantShanghai
- EmilyLikeshenzhen
- eric-heidenUniversity of Southern California
- flinger123
- FlyingAnt2018
- Frank00001xidian university
- HustHB
- jjn037Horizon
- kaiyaJinan University
- KANGRuipengBeiJing/HongKong/ShenZhen
- kevinzbw
- kimsimpleBeijing
- laomao0SJTU
- Lihit
- lijiunderstand
- looputHuazhong University of Science and Technology
- pean1128Tencent Game
- Pine-sha
- qiulesun
- shipeizhen
- the-butterflyHangZhou, ZheJiang Province
- vanpersie32
- WhatAShot
- xyl670988520
- yeyuling1990
- ys10
- zjshlyoyo
- ZJZzChina