This document outlines the data and computational requirements for participating in the AI-Vengers Federated Learning effort.
- Deep Learning Base AMI (Ubuntu 18.04)
- Test/Demo - t2.2xlarge
- 8 vCPU, 32GB memory, 100GB EBS disk, network performance “Moderate”
- At least 1 GPU
Mandatory Fields:
- Self Reported Race
- Gender
- Age
Optional Fields:
- Scanner Type
- Finding vs No Finding
0 - American Indian or Alaska Native
1 - Asian
2 - Black or African American
3 - Hispanic or Latino
4 - Native Hawaiian or Other Pacific Islander
5 - White
- Extract metadata and pngs from your DICOMs using Niffler PNG Extaction found here https://github.com/Emory-HITI/Niffler/tree/master/modules/png-extraction
- Prepare a CSV file with two columns ["Path", "Race"]
- Run the data preprocessing scripts which will output a JSON file for the train, test, and validation sets
python create_jsons.py [path/to/csv]
- Copy these JSON files into the data directory of your client
- Symlink your JSON file to a file named dataset.json
ln -s [json file] dataset.json