/vllm-ltr

[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers