/flashinfer

FlashInfer: Kernel Library for LLM Serving

Primary LanguageCudaApache License 2.0Apache-2.0

Watchers