/LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers