/Kangaroo

Implementation of Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Primary LanguagePython