/mimic-disparities

Plots from "Can AI Help Reduce Disparities in General Medical and Mental Health Care?"

Primary LanguageJupyter Notebook

Disparities in MIMIC-III

Here we recreate plots from "Can AI Help Reduce Disparities in General Medical and Mental Health Care?" by Chen, Szolovits, and Ghassemi 2019 (AMA Journal of Ethics)

Because of data proprietary, we cannot share the psychiatric dataset. The same code is used for both datasets.

We demonstrate:

  1. Data hetereogeneity in the MIMIC clinical notes through LDA topic modeling and disparities in topics by race, gender, and insurance type
  2. Disparities in predictive accuracy by race, gender, and insurance type

Recreating results

  1. Get MIMIC notes from make_mimic_notes.py. You will need to adjust the username and location of MIMIC data.

  2. Get Mallet topics from the notes. We convert the notes into separate text files in make_mallet_data.py. We then run Mallet in run_mallet_topics.sh.

  3. Create plots in Recreate_Plots.ipynb

Requirements

  1. MIMIC data access
  2. Mallet for topic modeling
  3. Python packages listed in requirements.txt