/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers