eric11eca/curriculum-ling
Curriculum is a new format of NLI benchmark for evaluation of broad-coverage linguistic phenomena. This linguistic-phenomena-driven benchmark can serve as an effective tool for diagnosing model behavior and verifying model learning quality.
Jupyter NotebookMIT
Issues
- 2
Where can I find the benchmark?
#1 opened by BKHMSI