rag-evaluation

There are 28 repositories under rag-evaluation topic.

Giskard-AI/giskard-oss
🐢 Open-Source Evaluation & Testing library for LLM Agents
Language:Python4.9k 37 493362
Marker-Inc-Korea/AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Language:Python4.3k 33 630348
Agenta-AI/agenta
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
Language:Python3.2k 28 737376
frutik/Awesome-RAG
353 9 429
vectara/open-rag-eval
Open source RAG evaluation package
Language:Python30617
LLAMATOR-Core/llamator
Framework for testing vulnerabilities of large language models (LLM).
Language:Python15514
mts-ai/rurage
Language:Python33 2 10
oztrkoguz/RAG-Framework-Evaluation
This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.
Language:Python14 2 00
ioannis-papadimitriou/rag-playground
A framework for systematic evaluation of retrieval strategies and prompt engineering in RAG systems, featuring an interactive chat interface for document analysis.
Language:Python9 2 02
rostyslavshovak/RAG-Retrieval-Augmented-Generation
RAG Chatbot for Financial Analysis
Language:Python8 1 00
simranjeet97/Learn_RAG_from_Scratch_LLM
Learn Retrieval-Augmented Generation (RAG) from Scratch using LLMs from Hugging Face and Langchain or Python
Language:Jupyter Notebook6 1 04
shaadclt/EvalRAG
A comprehensive evaluation toolkit for assessing Retrieval-Augmented Generation (RAG) outputs using linguistic, semantic, and fairness metrics
Language:Python4
fkapsahili/EntRAG
EntRAG - Enterprise RAG Benchmark
Language:Python3
bluewave-labs/evalwise
EvalWise is a developer-friendly platform for LLM evaluation and red teaming that helps test AI models for safety, compliance, and performance issues
Language:Python2
264Gaurav/medical-RAG-chatbot
A LangChain-based Retrieval-Augmented Generation (RAG) chatbot for medical data. Integrates with Gemini/Grok AI to deliver accurate, context-aware answers in healthcare and biomedical domains.
Language:Jupyter Notebook1
AnasAber/MLflow_with_RAG
Using MLflow to deploy your RAG pipeline, using LLamaIndex, Langchain and Ollama/HuggingfaceLLMs/Groq
Language:Python1 1 11
Kaos599/BetterRAG
BetterRAG: Powerful RAG evaluation toolkit for LLMs. Measure, analyze, and optimize how your AI processes text chunks with precision metrics. Perfect for RAG systems, document processing, and embedding quality assessment.
Language:Python1
sprakash21/aws-genai-rageval-bot
RAG Pipeline Evaluation and monitoring on AWS using RAGAS
Language:Python1
dfavenfre/RAG-Optimization
Language:Jupyter Notebook0 1 00
Gian207/RAG-lego-like-component
Proposal for industry RAG evaluation: Generative Universal Evaluation of LLMs and Information retrieval
Language:Python0 1 00
keitabroadwater/llm-eval-lab
A web sandbox for hands-on learning of LLM and RAG Evaluation
Language:TypeScript00
TajaKuzman/pandachat-rag-benchmark
PandaChat-RAG benchmark for evaluation of RAG systems on a non-synthetic Slovenian test dataset.
Language:Python0 1 00
igorsuhinin/rag-pdf-qa
RAG-powered PDF QA system with self-reflection and multiple retrieval strategies (Stuff/Map Reduce/Refine). Includes monitoring via Langfuse & LangSmith and containerization with Docker
Language:Python
jhaayush2004/RAG-Evaluation
Different approaches to evaluate RAG !!!
Language:Jupyter Notebook1 0
marktr11/RAG-Pipeline-LLM-Evaluation
A basic RAG (Retrieval-Augmented Generation) implementation and evaluation methodology built with Python.
Language:Jupyter Notebook
neomatrix369/AIE7-Cert-Challenge
AIE7: Certification Challenge
Language:Jupyter Notebook
OranDanon/Gen-AI-Assignment
Home assignment featuring two AI projects: a Medical Q&A Bot for Israeli HMOs and a National Insurance Form Extractor. Built with Azure OpenAI to demonstrate practical GenAI implementation skills.
Language:Python
OranDanon/RAG-application
RAG Chatbot over pre-defined set of articles about LangChain
Language:Python

rag-evaluation

Giskard-AI/giskard-oss

Marker-Inc-Korea/AutoRAG

Agenta-AI/agenta

frutik/Awesome-RAG

vectara/open-rag-eval

LLAMATOR-Core/llamator

mts-ai/rurage

oztrkoguz/RAG-Framework-Evaluation

ioannis-papadimitriou/rag-playground

rostyslavshovak/RAG-Retrieval-Augmented-Generation

simranjeet97/Learn_RAG_from_Scratch_LLM

shaadclt/EvalRAG

fkapsahili/EntRAG

bluewave-labs/evalwise

264Gaurav/medical-RAG-chatbot

AnasAber/MLflow_with_RAG

Kaos599/BetterRAG

sprakash21/aws-genai-rageval-bot

dfavenfre/RAG-Optimization

Gian207/RAG-lego-like-component

keitabroadwater/llm-eval-lab

TajaKuzman/pandachat-rag-benchmark

igorsuhinin/rag-pdf-qa

jhaayush2004/RAG-Evaluation

marktr11/RAG-Pipeline-LLM-Evaluation

neomatrix369/AIE7-Cert-Challenge

OranDanon/Gen-AI-Assignment

OranDanon/RAG-application