Brainstorming OpenAI Evals for investigating the effects and mechanisms of chain-of-thought prompting.
No issues in this repository yet.