Pinned Repositories
awesome-human-label-variation
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation (EMNLP 2022)
annotation-paradigms
Röttger et al. (NAACL 2022): "Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks"
efficient-low-resource-hate-detection
Röttger et al. (EMNLP 2022): "Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages"
exaggerated-safety
Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
hatecheck-data
Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
hatecheck-experiments
Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code
llm-values-pct
multiq
safetyprompts-paper
temporal-adaptation
Röttger and Pierrehumbert (EMNLP 2021 Findings): "Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media"
paul-rottger's Repositories
paul-rottger/exaggerated-safety
Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
paul-rottger/hatecheck-data
Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
paul-rottger/hatecheck-experiments
Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code
paul-rottger/efficient-low-resource-hate-detection
Röttger et al. (EMNLP 2022): "Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages"
paul-rottger/multiq
paul-rottger/temporal-adaptation
Röttger and Pierrehumbert (EMNLP 2021 Findings): "Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media"
paul-rottger/annotation-paradigms
Röttger et al. (NAACL 2022): "Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks"
paul-rottger/llm-values-pct
paul-rottger/safetyprompts-paper