/scaleeval

Scalable Meta-Evaluation of LLMs as Evaluators

Primary LanguagePython

Watchers