/uncheatable_eval

Evaluating LLMs with Dynamic Data

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers