/BIG-Bench-Mistake

A dataset of LLM-generated chain-of-thought steps annotated with mistake location.

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers