/LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Primary LanguagePython

Watchers

No one’s watching this repository yet.