/swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers