bigscience-workshop/evaluation

Add WinoMT to Full Benchmark

epavlick opened this issue · 2 comments

Add WinoMT to Full Benchmark

I can help with this!

I am working on a similar WinoBias dataset to evaluate gender bias in coreference resolution. The prompts were already available in promptsource.