/self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Stargazers

No one’s star this repository yet.