NVIDIA/TensorRT-LLM

[Feature]: support eagle3 speculative decoding for gemma3 on torch backend

lzyrapx opened this issue ยท 0 comments

๐Ÿš€ The feature, motivation and pitch

support eagle3 speculative decoding for gemma3 model on torch backend

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.