Use a trusted LLM to evaluate new LLM's answers, given datasets and evaluation criteria
Primary LanguagePythonGNU Affero General Public License v3.0AGPL-3.0