Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs
Primary LanguageCGNU General Public License v3.0GPL-3.0