Pinned Repositories
infinity
Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
pipableAI
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
shubham-bnxt's Repositories
shubham-bnxt doesn’t have any repository yet.