/paella

Paella: Low-latency Model Serving with Virtualized GPU Scheduling

Primary LanguageC++

Stargazers