/eval_llm

Use a trusted LLM to evaluate new LLM's answers, given datasets and evaluation criteria

Primary LanguagePythonGNU Affero General Public License v3.0AGPL-3.0

Stargazers