/DistServe

Disaggregated serving system for Large Language Models (LLMs).

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Watchers