ChaosCodes/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Python
Watchers
No one’s watching this repository yet.
Fast inference from large lauguage models via speculative decoding
Python
No one’s watching this repository yet.