/llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers