/infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

Primary LanguagePythonMIT LicenseMIT

Issues