Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
Primary LanguageJupyter NotebookApache License 2.0Apache-2.0