suzgunmirac/BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

MIT

Issues

New, harder reasoning problems for longer contexts
#11 opened 4 months ago by tehruhn
0
Potential for LongScope to be added to BigBench Hard?
#9 opened a year ago by mrconter1
0
HuggingFace Dataset
#8 opened a year ago by MaveriQ
0
Can you share the script to build the dataset?
#7 opened a year ago by imoneoi
0
causal judgment answer keys with acceptable rationales?
#5 opened 2 years ago by i-am-neo
2
Duplicated inputs with conflicting targets in `causal_judgement.json`
#3 opened 2 years ago by gabrielStanovsky
1
Potential typos in CoT prompts
#4 opened 2 years ago by samar-khanna
3
PaLM predictions?
#2 opened 2 years ago by jmhessel
2
How to find evaluation metric?
#1 opened 2 years ago by darvincy
1