SparkJiao/llama-pipeline-parallel
A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.
Python
Stargazers
- 11zhouxuan
- akk-123
- benson-guo
- BeyonderXXFudan University
- bjoernpl@ellamind @DiscoResearch
- Dench991228
- DerrickWang005The University of Sydney
- enze5088Chinese Academy of Sciences
- fly51flyPRIS
- fmh1art
- fwyc0573
- grimulkan
- Guhaifudeng
- haozhx23
- HuangLKsysu
- JaheimLee
- JeffCarpenterCanada
- jiequancuiNTU
- LAKan233
- luckyyangrun
- nschlemmcoder nostra GmbH
- nth2000
- puppet101
- qcwthu
- qmpzqmpzSamsung Electronics Co., Ltd.
- Saigut
- SamaelChen
- SandalotsVolcanak
- ShuaipengWu
- SinclairCoderChina
- Songjw133
- SparkJiaoNTU-NLP & I2R, A*STAR, Singapore
- TabrisreiChina
- XianzheMaZürich
- xuezc
- ZurichRainSoochow University