- add uncertain protocol
- Change ref away from davinci
- fix the bug in the highconf protocol that leads to vastly prefer the first answer
- more epochs/data --> so that dumb labels are not so good, maybe always start by ft on dumb to get the right fmt?
- better red team spurious cues
- shut downs
- add spurious cue detection