retrieval-augmented-generation

There are 561 repositories under retrieval-augmented-generation topic.

  • chatchat-space/Langchain-Chatchat

    Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

    Language:Python28.8k2673.3k5.1k
  • haystack

    deepset-ai/haystack

    :mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

    Language:Python14.2k1293.3k1.7k
  • infiniflow/ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

    Language:Python9k57539842
  • txtai

    neuml/txtai

    💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

    Language:Python7.3k79696509
  • FlagOpen/FlagEmbedding

    Retrieval and Retrieval-augmented LLMs

    Language:Python5.6k33778390
  • TaskingAI

    TaskingAI/TaskingAI

    The open source platform for AI-native application development.

    Language:Python5.4k6270260
  • storm

    stanford-oval/storm

    An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

    Language:Python4.6k3929452
  • llmware-ai/llmware

    Unified framework for building enterprise RAG pipelines with small, specialized models

    Language:Python4.1k42116809
  • llm-app

    pathwaycom/llm-app

    LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,⚡in sync with your data sources.

  • cognita

    truefoundry/cognita

    RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

    Language:Python2.6k2913200
  • infiniflow/infinity

    The AI-native database built for LLM applications, providing incredibly fast full-text and vector search

    Language:C++2k25271157
  • vearch

    vearch/vearch

    Distributed vector search for AI-native applications

    Language:Go2k76572317
  • langroid

    langroid/langroid

    Harness LLMs with Multi-Agent Programming

    Language:Python1.8k17141172
  • NVIDIA/GenerativeAIExamples

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Language:Python1.7k5132276
  • SciPhi-AI/R2R

    Build and deploy a fully-featured, observable, user-facing RAG backend in minutes.

    Language:HTML1.3k1035116
  • devflowinc/trieve

    All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.

    Language:Rust1.1k955495
  • kingjulio8238/memary

    Longterm Memory for Autonomous Agents.

    Language:Jupyter Notebook1.1k142275
  • safevideo/autollm

    Ship RAG based LLM web apps in seconds.

    Language:Python93315091
  • qdrant/fastembed

    Fast, Accurate, Lightweight Python library to make State of the Art Embedding

    Language:Python91999770
  • Neurite

    satellitecomponent/Neurite

    Fractal Graph-of-Thought. Experimental Mind-Map for Ai-Agents, Web-Links, Notes, and Code.

    Language:JavaScript852291670
  • pchunduri6/rag-demystified

    An LLM-powered advanced RAG pipeline built from scratch

    Language:Python7455644
  • RUC-NLPIR/FlashRAG

    ⚡FlashRAG: A Python Toolkit for Efficient RAG Research

    Language:Python71872254
  • jxzhangjhu/Awesome-LLM-RAG

    Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

  • parthsarthi03/raptor

    The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

    Language:Python59693179
  • serverless-chat-langchainjs

    Azure-Samples/serverless-chat-langchainjs

    Build your own serverless AI Chat with Retrieval-Augmented-Generation using LangChain.js, TypeScript and Azure

    Language:TypeScript5491733220
  • redis-developer/ArXivChatGuru

    Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

    Language:Python50991368
  • louisfb01/start-llms

    A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

  • eugeneyan/obsidian-copilot

    🤖 A prototype assistant for writing and thinking

    Language:Python4577832
  • philschmid/clipper.js

    HTML to Markdown converter and crawler.

    Language:TypeScript4524628
  • BaranziniLab/KG_RAG

    Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

    Language:Jupyter Notebook444151264
  • EvilPsyCHo/Play-with-LLMs

    Tutorial on training, evaluating LLM, as well as utilizing RAG, Agent, Chain to build entertaining applications with LLMs.分享如何训练、评估LLMs,如何基于RAG、Agent、Chain构建有趣的LLMs应用。

    Language:Jupyter Notebook44371974
  • charent/Phi2-mini-Chinese

    Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

    Language:Jupyter Notebook40981244
  • jonfairbanks/local-rag

    Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.

    Language:Python40691645
  • snexus/llm-search

    Querying local documents, powered by LLM

    Language:Jupyter Notebook402114247
  • SciPhi-AI/agent-search

    AgentSearch is a framework for powering search agents and enabling customizable local search.

    Language:Python3933542
  • Clarifai/clarifai-python

    Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!

    Language:Python3924369118