Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
Primary LanguagePythonMIT LicenseMIT
No issues in this repository yet.