/SpeculativeDecoding

Pytorch and HF implementation of standard auto-regressive decoding and speculative decoding

Primary LanguagePython

Watchers