/mit-elos

Based on https://github.com/yugangjia/ELOS-MIMICIV.git

Primary LanguageRMIT LicenseMIT

SOFA Components' and End-of-Life-Survey

Based on https://github.com/yugangjia/ELOS-MIMICIV.git

How to run this project?

1. Get the Data!

Both MIMIC and eICU data can be found in PhysioNet, a repository of freely-available medical research data, managed by the MIT Laboratory for Computational Physiology. Due to its sensitive nature, credentialing is required to access both datasets.

Documentation for MIMIC-IV's can be found here and for eICU here.

Integration with Google Cloud Platform (GCP)

In this section, we explain how to set up GCP and your environment in order to run SQL queries through GCP right from your local Python setting. Follow these steps:

  1. Create a Google account if you don't have one and go to Google Cloud Platform
  2. Enable the BigQuery API
  3. Create a Service Account, where you can download your JSON keys
  4. Place your JSON keys in the parent folder (for example) of your project
  5. Create a .env file with the command cp env.example env
  6. Update your .env file with your JSON keys path and the id of your project in BigQuery

MIMIC-IV

After getting credentialing at PhysioNet, you must sign the data use agreement and connect the database with GCP, either asking for permission or uploading the data to your project. Please note that only MIMIC v2.0 is available at GCP.

Having all the necessary tables for the cohort generation query in your project, run the following command to fetch the data as a dataframe that will be saved as CSV in your local project. Make sure you have all required files and folders.

python3 src/py/get_data.py --dataset "MIMIC"

This will create the file data/MIMIC_data.csv

With the following command, you can get the same cohort we used for the study:

source("src/r/cohort_MIMIC.R")

This will create the files data/cohorts/MIMIC_cohort.csv.

eICU-CRD

The rationale for eICU-CRD is similar. Run the following commands:

python3 src/py/get_data.py --dataset "eICU"

This creates the file data/eICU_data.csv

source("src/r/cohort_eICU.R")

This creates the files data/cohorts/eICU_cohort.csv.

2. Run the Statistical Models/Regressions

We made it really easy for you in this part. All you have to do is:

source("src/r/model.R")

And you'll get the results here: results/models

How to contribute?

We are actively working on this project. Feel free to raise questions opening an issue, send an email to jcmatos@mit.edu or to fork this project and submit a pull request! This project was done by Yugang Jia.