llm-evaluation-metrics

There are 3 repositories under llm-evaluation-metrics topic.

confident-ai/deepeval
The LLM Evaluation Framework
Language:Python3.6k 21 287285
zhuohaoyu/KIEval
[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
Language:Python32 3 22
ritwickbhargav80/quick-llm-model-evaluations
This repo is for an streamlit application that provides a user-friendly interface for evaluating large language models (LLMs) using the beyondllm package.
Language:Python