/crashbench

Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs

Primary LanguageCGNU General Public License v3.0GPL-3.0

Watchers