Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
Primary LanguagePythonOtherNOASSERTION