galeselee/Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!
Stargazers
- andakaiBeijing
- ATCPInstitute of Computing Technology
- chenhongyu2048
- chosen-oxShenzhen CN
- damionfanUCAS
- duanzhaolTJU
- durant1999
- Eunice710
- fake0fanHong Kong
- Forestree
- Fragile-azalea
- iku-iku-ikuUniversity of Electronics and Technology of China
- jeferayPeking University
- Joeyzhouqihui
- Lec16sf
- luqiang6q
- lzzmmHKUST(Guangzhou)
- nasimshmUniversity of Texas at Arlington
- palhotellikeada
- Pang-GJHarbin Institute of Technology
- pku0xffPeking University
- pprpData Science and Analytic Thrust, Information Hub, HKUST(GZ)
- renwuli
- sunxt99University of Chinese Academy of Sciences
- tensorboyTikTok Inc
- vangohaoPeking University
- victbr
- wabluyHong Kong
- wangyuyueLos Angeles
- xiurui-panTsinghua University
- xvbolaiChina University of Petroleum, Beijing
- xxyuxData Science and Analytic Thrust, Information Hub, HKUST(GZ)
- YukeWang96University of California, Santa Barbara
- zcgit001Huazhong University of Science and Technology
- zss19930910
- zyuxlove