Simple implementation of LLM-As-Judge for pairwise evaluation of Q&A models
Primary LanguagePythonGNU General Public License v3.0GPL-3.0