/speculative_decoding.c

minimal C implementation of speculative decoding based on llama2.c

Primary LanguageCMIT LicenseMIT

Watchers