/moral-evals

Primary LanguageHTMLMIT LicenseMIT

Procedural Dilemma Generation for Moral Reasoning in Humans and Language Models Causal Template -> Prompt Template -> Test Items

🧐 What is this?

This is a supporting repository for our paper "Procedural Dilemma Generation for Moral Reasoning in Humans and Language Models" (2024, CogSci).

Preregistrations

Preregistrations for all experiments are available on the Open Science Framework (OSF):

Repository structure

├── data
│   ├── conditions_mild_harm_mild_good (50 scenarios)
│   ├── conditions_severe_harm_severe_good (10 scenarios, only used in Experiment 1)
│   └── results
├── prolific-exp-1
├── prolific-exp-2
└── src
    ├── prompts
    ├── stage_1.py
    └── stage_2.py
  • data contains the conditions conditions_mild_harm_mild_good including matched mild harm and mild good outcomes. We used to first 10 scenarios from each condition for our comparison in the paper (80 items). We also include conditions_severe_harm_severe_good which includes conditions with matched severe harm and severe good outcomes.
    • results include model responses including chain-of-thought examples
  • prolific-exp-1 and prolific-exp-2 include the experimental stimuli and formatted participant data frames (_long_format.csv)
  • src includes prompts and code (stage_1.py and stage_2.py) for generating items.
  • docs contains all the experiment code. You can preview the experiments below: