/paella

Paella: Low-latency Model Serving with Virtualized GPU Scheduling

Primary LanguageC++MIT LicenseMIT

Stargazers