Pytorch and HF implementation of standard auto-regressive decoding and speculative decoding
Primary LanguagePython