/NPHardEval

Repository for NPHardEval, a quantified-dynamic benchmark of LLMs

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0