Prediction of hospital admission at emergency department using machine learning

Background

Emergency department (ED) visit is one of the most common ways to get medical support and ED represents the largest source of hospital admissions. In order to improve the running efficiency of ED, optimize the resource allocation, as well as to maximize the number of patients that get appropriate treatment, we train machine learning models to predict hospital admission at the time of ED triage using patients' triage information and previous medical history.

Note: This is the course project for 02-718 Computational Medicine in fall 2021

Data souorce

The Electronic Health Record (EHR) data we use was from a paper published on PLOS one. The original retrospective data was obtained from three Emergence Departments from March 2013 to July 2017, each ensuring one year of historical timeframe. We obtained the raw data from this Kaggle dataset.

Note: Because data is large (> 500M), which includes 560,486 patient visits with 972 variables, we didn't upload it under this repo. But we can send it to you if your are interested.

Workflow

workflow

File organization

  • EDA folder: includes scripts for EDA and some plots
  • Data folder: includes Feature_dexcription.xlsx file which has descriptions for each feature, and featureSelectedAllDataWithY.csv, which is the full data after feature selection.
  • comparison_plot folder: includes performance comparsion between four models
  • model folder: includes feature selection .py script, scripts for training four models and code for drawing comparsion plots
  • PlusOnePaper&Proposal folder: includes the original PLOS one paper and our project proposal

Authors