/llama-box

LLM inference server implementation based on llama.cpp.

Primary LanguageC++MIT LicenseMIT

Watchers