Issues
- 0
Work On CPU
#16 opened by ZepinLi - 8
- 1
Estimate the number of generated tokens per step from the acceptance-rate-vector?
#14 opened by KexinFeng - 3
Question on tree search algorithm
#15 opened by cyLi-Tiger - 2
Is there any benchmark that compares Sequoia against vanilla speculative decoding?
#10 opened by KexinFeng - 7
- 1
The support on vLLM?
#11 opened by KexinFeng - 0
Thanks for your good work.
#9 opened by xwjim - 0
data loading timing and disk use
#4 opened by poedator - 2
Integration with Lit-GPT
#3 opened by tchaton - 5
- 1