Google Search Results Datasets

This repository includes the dataset of search results accompanying Towards Understanding Gender-Seniority Compound Bias in Natural Language Generation (LREC 2022).

All data files are named as with the format domain-gender-seniority-samples.txt

If you use this dataset, please cite the paper

@misc{https://doi.org/10.48550/arxiv.2205.09830, doi = {10.48550/ARXIV.2205.09830},

url = {https://arxiv.org/abs/2205.09830},

author = {Honnavalli, Samhita and Parekh, Aesha and Ou, Lily and Groenwold, Sophie and Levy, Sharon and Ordonez, Vicente and Wang, William Yang},

keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},

title = {Towards Understanding Gender-Seniority Compound Bias in Natural Language Generation},

publisher = {arXiv},

year = {2022},

copyright = {arXiv.org perpetual, non-exclusive license} }