suzgunmirac/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
MIT
Issues
- 0
- 0
- 0
HuggingFace Dataset
#8 opened by MaveriQ - 0
- 2
- 1
- 3
Potential typos in CoT prompts
#4 opened by samar-khanna - 2
PaLM predictions?
#2 opened by jmhessel - 1
How to find evaluation metric?
#1 opened by darvincy