HuangLK/transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
PythonApache-2.0
Stargazers
- AltenLiChina
- another1s
- bllddECNU
- darth-veitcherLondon
- ezioliao
- germanjkeHuawei
- HuangLKsysu
- hudengjunaiHangzhou,Zhejiang,China
- idootopMint
- jonataslawIris
- JY-Ren
- KingGoldXu
- kir152
- KylixC
- l294265421Tencent
- LiGhtime
- lumosity4tpjShenzhen
- macarthur99
- mars79668
- muou55555CloudWalk
- now-101
- puppet101
- SandalotsVolcanak
- SysuCharon
- taishiciR
- ufwt
- Ulov888Hang Zhou
- vitrun@Bytedance @Wacai @Duitang
- wang-zeruiShanghai AI Laboratory / Shanghai Jiao Tong University
- wangjiaqiys
- xiaj1011
- Xie-Minghui
- ykk648Nan Jing
- zhangsanfeng86
- zhhao1
- zydsBeijing