Leap-of-Thought: Accelerating Transformers via Dynamic Token Routing (EMNLP 2023)
Primary LanguagePython