feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
PythonApache-2.0
Stargazers
- AdrianJohnston
- baudzhou
- camphora
- caoshijie0501
- daralthusMakethink
- diggerduBytedance AI Lab
- fakerbabyFudan University
- feifeibearTencent
- fly51flyPRIS
- FrankLeeeeeNanyang Technological University
- frankxyyShanghai, China
- glcolor
- Gy-Lu@hpcaitech
- hewr1993
- i7990X
- liminnShangHai
- Nealcly
- NHZlXBeijing
- noobimpUniversity of Chinese Academy of Sciences
- ryantd@kwai
- SandalotsVolcanak
- seshurajup@dolcera
- SeTrionesBeijing
- ShengganNational University of Singapore
- Stick-To
- super-dainiu@gersteinlab @albert-maxwell @Yale-CompBio
- TianzhongSongShanghai
- tnlinAlibaba Tongyi
- UranusSevenXprobe
- varunshenoy
- Whiplashzeb
- wktzjz
- Xu-KaiNational University of Singapore
- youxiho1Nankai University
- Yuan-ManXShanghai, China
- ZhangYunchenY