/Speculative-Sampling

Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.