bigscience-workshop/evaluation

Add HuffPo Text Classification to Full Benchmark

epavlick opened this issue · 3 comments

use to test generalization to unseen labels; maybe use FLEX?

I will do this

I would like to help in this.