Is there any benchmark that compares Sequoia against vanilla speculative decoding?

Question

KexinFeng opened this issue 7 months ago · 2 comments

Hi,

Thanks for the great work!

I'm wondering if there is any benchmark that compares Sequoia against vanilla speculative decoding?

Answer 1 · 2024-04-12T05:37:09.000Z

Once you get your acceptance rate, I think you can directly calculate the theoretical best gamma and speedup for vanilla speculative decoding.

Answer 2 · 2024-04-12T06:31:26.000Z

Hello, you can use 4-chain, 8-chain in L40_growmaps as growmaps, which will produce "tree structures" for vanilla speculative decoding. Thank you!