/specificityplus

👩‍💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"

Primary LanguagePythonOtherNOASSERTION

Watchers