michaelfeil/infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
PythonMIT
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
PythonMIT