Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Primary LanguagePythonGNU General Public License v3.0GPL-3.0