Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
Primary LanguagePython