YangZhou08/LLMSpeculativeSamplingModifi
Fast inference from large lauguage models via speculative decoding
Python
No issues in this repository yet.
Fast inference from large lauguage models via speculative decoding
Python
No issues in this repository yet.