dilab-zju/self-speculative-decoding
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
Jupyter NotebookApache-2.0
Stargazers
- BarfingLemurs
- caiyuhuShanghai University
- dawn2034Beijing China
- FABSXAbHangzhou, China
- felix-ky
- fishfishfishfishfish
- fly51flyPRIS
- gmftbyGMFTBYBeijing Institute of Technology
- HillZhang1999Bytedance
- hunter-lee1
- IamXuLiang
- imp0821Zhejiang University
- IvesCheung
- josecohenca
- KaiLv69
- KerfuffleV2
- lin72h
- LorrinWWWZhejiang University
- Ma-Yongqiang
- Nealcly
- ozyyshrIL, USA
- RahulSChand
- raymin0223KAIST AI
- Ryu1845
- sambroy
- SandalotsVolcanak
- stjordanisGreece
- tbEgg
- tricktreatZhejiang University
- we1kHangzhou, China
- wjie98
- xzymustbexzy
- YixinSong-eSJTU
- yiyihum
- yzhangcsSoochow University
- zui-jiangQingdao