This repo contains the supplementary materials for ACL 2021 paper: "All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text"

It contains:

Priming texts
Training materials

Priming texts

When generating the machine-generated texts with GPT2 and GPT3, we used a "three-shot" setting, conditioning each passage on 3 in-domain, human-authored texts. The three texts we used for each domain are in priming_texts.tsv.

Training materials

Two of our trainings (Examples and Comparison) showed the evaluators example passages, along with a brief explanation for each example. The exact passages used for training in each domain are in training_materials.tsv.

bshmueli/nlg_human_evals

Priming texts

Training materials