/PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Primary LanguageC++MIT LicenseMIT

Watchers