/RAGHub

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

MIT LicenseMIT

RAGHub: A Directory of Tools for Retrieval-Augmented Generation (RAG)

Welcome to RAGHub, a living collection of new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. This is a community-driven project for r/RAG, where we aim to catalog the rapid growth of RAG tools and projects that are pushing the boundaries of the field.

Each day, it feels like a new tool or framework emerges, and choosing the right one is becoming more of an art than a science. Is the framework from three months ago still relevant? Or was it just hype, rehashing old concepts with a fresh look? RAGHub exists to help you stay ahead of these changes, providing a platform for the latest innovations in RAG.

How to Contribute

This is a community project, and we welcome contributions from everyone! If you’d like to add a new framework, project, or resource, please check out our Contribution Guidelines for details on how to get started.

Table of Contents

RAG Frameworks

Name Description Website Github Stars Activity
LangChain A framework for building applications with LLMs Website Github 93.2k 9h ago
Haystack A framework for building search engines using neural networks Website Github 17k Last week
LlamaIndex A framework for building data-driven LLM applications Website Github 35.9k 7h ago
BentoML Build Inference APIs, LLM apps, Multi-model chains, RAG service Website Github 7k 1h ago
LightRAG Simple and fast Retrieval-Augmented Generation Website Github 268 1d ago
Swarm by OpenAI Educational framework for lightweight multi-agent orchestration - Github 7k 1d ago
Langroid Python framework to easily build LLM-powered applications Website Github 2.4k 10h ago
NeMo-Guardrails Toolkit for adding programmable guardrails to LLM-based applications Website Github 4k Last week
Swiftide A Rust library for building fast, streaming applications with LLMs Website Github 222 1h ago
Korvus The entire RAG pipeline in a single database query Website Github 1.3k Last month
semantic-router A framework for routing LLM requests using semantic vectors. Website Github 2k 4h ago
AWS Bedrock Knowledge Bases Service to build, scale, and deploy RAG-powered applications Website - - 1h ago
langflow A framework to build, scale, and deploy RAG and multi-agent AI apps. Website Github 31.4k 1h ago
dspy A framework to build language model apps with modular programming. Website Github 17.8k 13h ago
mem0 The Memory layer for your AI apps. Website Github 22.6k 2h ago

RAG Evaluation and Optimization Frameworks

Name Description Website GitHub Stars Activity
Trulens Measures and enhance LLM app quality with feedback functions for scalable evaluation Website Github 2.1k 11h ago
Phoenix AI observability platform designed for experimentation, evaluation, and troubleshooting Website Github 3.6k 1d ago
ragas Evaluates and quantifies the performance of RAG pipelines that enhance LLM context with external data Website Github 6.8k 3h ago
Deepchecks Continuous validation of AI & ML models, detecting data drift and model issues Website Github 3.6k 8m ago
AutoRAG End-to-end RAG optimization: parsing, chunking, evaluation dataset creation, and pipeline deployment Website Github 1.6k 1h ago
evalmy.ai Fine tuned lightweight RAG evaluation service + python client lib Website Github -- --
TextGrad A Framework for LLM-based text optimization,focusing on reducing hallucinations, and improving prompt. Website Github 1.7k 24h ago
langfuse Traces, evals, prompt management and metrics to debug and improve your LLM application. Website Github 6.9k 1h ago

RAG Engines

Name Description Website GitHub Stars Activity
TrustGraph LLM Agnostic Agent Development Platform Website Github 120 2d ago
R2R The Elasticsearch for RAG, helps you quickly build and launch scalable RAG solutions Website Github 3.4k 6h ago
RAGFlow Open-source RAG engine based on deep document understanding Website Github 18.7k 1h ago
Vertex AI Knowledge Engine A data framework for context-augmented LLM applications Website - - 1d ago
Embedchain Open Source Framework for personalizing LLM responses under 10 lines of code Website Github 22.2k Last week
txtai All-in-one embeddings database for semantic search, LLM orchestration, and RAG workflows Website Github 8.8k Last week
dsRAG High-performance retrieval engine for unstructured data - Github 815 Last week
Flash-Rank Use Pairwise or Listwise rerankers to improve search accuracy before passing to LLMs. Github 606 2w ago
Graphlit API-first platform for building knowledge-driven AI applications and agents Website Github 16 8h ago
rag-citation Combines RAG with automatic citation generation to enhance content credibility Website Github 6 Last week
PostgresML Postgres + GPUs with functions for chunking, embedding, transforming and ranking Website Github 6k Yesterday
chainlit Build production-ready Conversational AI applications in minutes, not weeks Website Github 7k 24 h ago
pathway Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG. Website Github 4.3k 7 h ago
cognita RAG framework for modular, open-source production apps. Website Github 3.3k 2 days ago
FlashRAG A Python Toolkit for Efficient RAG Research - Github 1.3k 3 hour ago
RAGatouille Easily train and use advanced retrieval methods in any RAG pipeline. - Github 3.1k 4 months ago

RAG Projects

Name Description Website GitHub Stars Activity
LlamaParse GenAI-native document parsing platform Website Github 2.8k 2d ago
Langchain-extract Web server to extract information from text and files using LLMs Website Github 1k 4m ago
Needle Production ready RAG pipelines out of the box. Website Github 15 1h ago
Unstructured.io build custom preprocessing pipelines for labeling, training, or production ml Website Github 8.7k 3d ago
Verba RAG chatbot powered by Weaviate Website Github 6.1k 2w ago
Unstract No-code Platform to launch APIs and ETL Pipelines to structure unstructured documents Website Github 2.3k 4h ago
Humata.ai Ask questions across all of your document files Website 4h ago
Ragie.ai Fully managed RAG-as-a-Service for developers. Website Github 12 12h ago
Reducto Parses complex documents and creates LLM-ready inputs Website Github 16 2w ago
Midship Extract document data straight into your spreadsheet/erp/crm Website Github - -
DocuPanda Convert documents into a structured, standard set of fields and values Website - - -
contextual-doc-retrieval-opneai-reranker Using GPT-4 and Cohere for query expansion and re-ranking with BM25 GitHub 20 Last week
Raggenie Low-code platform to build custom RAG-based AI applications Website Github 60 10h ago
Chunkr Vision model based PDF chunking and OCR, optimized for fast processing of large datasets Website Github 651 11h ago
tldw Open-Source project similar to NotebookLM Website Github 267 Yesterday
Cerbos Access control for RAG and LLMs. Website GitHub 3200 14h ago
extractous Extremely fast data extraction for your AI applications Website Github - -
SWIRL AI Search & RAG for your workplace. Get AI insights from your company's knowledge instantly. Website Github 2.1K 2w ago

RAG Resources and Sites

Site/Article Description Link
Contextual Retrieval Anthropic introducing Contextual Retrieval Website
Open-RAG Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models Website
ColPali Efficient Document Retrieval with Vision Language Models Website
RAG_Techniques Showcases various advanced techniques for RAG systems Website
GenAI_Agents Tutorials and implementations for various AI Agent techniques Website

Model LeaderBoards

Name Description Link
Artificial Analysis LLM Comparison Website
HuggingFace/mteb Embedding models leaderboard Website

If you're looking for mainstream RAG frameworks and techniques**, check out the excellent repository by Nir Diamant: RAG Techniques. This repository focuses on more established tools and methods that have already gained traction in the community.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Join the Conversation

This project is part of the r/RAG community. Have feedback or suggestions? Feel free to open an issue, start a discussion, or join the conversation on our Discord server! We want to make this repository a valuable resource for everyone exploring the RAG ecosystem, and your input is crucial.