A simple Docker/FastAPI wrapper around Llama.cpp to run it in a k8s container
Primary LanguageDockerfileMIT LicenseMIT