dwarvesf/llm-hosting
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
Python
No issues in this repository yet.