techandy42/bug_in_the_code_stack_v2
Can LLMs find bugs that compilers can't?: A benchmark for measuring LLMs' capabilities in debugging large source code.
Jupyter Notebook
No issues in this repository yet.
Can LLMs find bugs that compilers can't?: A benchmark for measuring LLMs' capabilities in debugging large source code.
Jupyter Notebook
No issues in this repository yet.