Evaluating-Gender-Bias-in-Spanish-Deep-Learning-Models: A Python repository from IsGarrido

General notes

Viewer V1 is here
Viewer V2 is here

Evaluate Categories

Setup

Create a conda env, install dependencies.

Run FillTemplate.py

Create a file with the templates that you want to test in TSV format. Ex templates_123.tsv.
Place your template file inside assets/data. Ex assets/data/templates_123.tsv or assets/data/Experiment name/templates_123.tsv.
(Optional) Create a TSV file with a the list of models that your experiment will run. Any Fill mask model on HuggingFace should work.
Run FillTemplate.py python src/FillTemplate.py 'Experiment name' 'templates_123.tsv' 'models.tsv' 10.
This will generate an output inside assets/result/Experiment name/FillTemplate. The interesting file is FillTemplate.json.

Generating the category file

Grab the result of the latest experiment in assets/result/Experiment name/FillTemplate/FillTemplate.json
Head to the Category Tool
(Optional) If you already had categorized adjetives, you can drop them in the Filled box so you only have to categorize the new words. If not just start clear.
Drag and drop Adjectives.json into the box with the same name.
Use the "New category" input to add as many categories as needed.
You will be categorizing the word that is pointed by an arrow. To move its category just click a column.
When you are done, just click the Save to file button. It will be saved in your browser's default downloads folder as dllas-categorizedwords.json. You can check and edit this json file to tweak category name or contents.

Changes will be saved as you work in localStorage. If you want to continue later, you can Drop again the Adjectives.json file and click the Load last session button to recover your changes.

Run EvaluateCategories.py

Move the categorization result from the previous phase ( dllas-categorizedwords.json ) it to the folder assets/data with some nice name. Ex Yulia.json.
Run python src/EvaluateCategories.py 'Experiment name' 'Yulia.json'. It will generate some result files on assets/result/Experiment name/EvaluateCategories

Replicate base experiment

git clone the project
Run FillTemplate.py without params
Run EvaluateCategories without params

cd dllas-evaluator
python src/FillTemplate.py
python src/EvaluateCategories.py

General notes on the folder structure

You should follow the folder convertion, you can check the folder structure of the default experiment:

The base folder is on the root of the projects, named assets. Inside we have two folders data and result.
data folder, for our input data. Inside you should add your data in the folder matching the script name, ex place some_templates.tsv inside FillTemplates folder.
result folder is for our experiment results. Every experiment should have a unique label that you will be passing as a cli param. Inside result we will have one folder for each experiment, not much to worry here as they will be created when you run the scripts.

Run the tests

chmod -x ./tests/TestEvaluateCategories.py
nosetests --with-watch --rednose --nologcapture src.tests

IsGarrido/Evaluating-Gender-Bias-in-Spanish-Deep-Learning-Models