A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
Primary LanguagePythonApache License 2.0Apache-2.0