ChaosCodes/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Python
Stargazers
No one’s star this repository yet.
Fast inference from large lauguage models via speculative decoding
Python
No one’s star this repository yet.