Pinned Repositories
BentoML
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
OpenLLM
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
text-generation-inference
Large Language Model Text Generation Inference
langchain
🦜🔗 Build context-aware reasoning applications
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
txtai
💡 Build AI-powered semantic search applications
mistral-inference
Official inference library for Mistral models
mlc-llm
Universal LLM Deployment Engine with ML Compilation
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Matthieu-Tinycoaching's Repositories
Matthieu-Tinycoaching/txtai
💡 Build AI-powered semantic search applications