/honest

A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.

Primary LanguagePythonMIT LicenseMIT

Watchers