/FS-GEN

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.

Primary LanguagePython

Watchers