/Patient-Survival-Prediction-using-Deep-Learning

Deep learning based modeling approach to train large dataset and predict patient survival

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Patient-Survival-Prediction-using-Deep-Learning

Deep learning based modeling approach to train large dataset and predict patient survival

šŸ§¾Description: Getting a rapid understanding of the context of a patientā€™s overall health has been particularly important during the COVID-19 pandemic as healthcare workers around the world struggle with hospitals overloaded by patients in critical condition. Intensive Care Units (ICUs) often lack verified medical histories for incoming patients. A patient in distress or a patient who is brought in confused or unresponsive may not be able to provide information about chronic conditions such as heart disease, injuries, or diabetes. Medical records may take days to transfer, especially for a patient from another medical provider or system. Knowledge about chronic conditions can inform clinical decisions about patient care and ultimately improve patient's survival outcomes.

Source of the dataset: Click Here

šŸ§­ Problem Statement: The target feature is hospital_death which is a binary variable. The task is to classify this variable based on the other 185 features step-by-step by going through each day's task. The scoring metric is Accuracy/Area under ROC curve.

Web application:

Streamlit App

Key challange:

  1. Large dataset with over 90K rows and 186 columns.
  2. Lack of significant relations with target feature

Steps taken to solve the problem:

  1. Handled Missing Values using MCAR techniques
  2. EDA of the dataset to find distributions and relations of features
  3. Upsampling using SMOTE
  4. Feature selection using chi2 statistics
  5. Baseline modeling using neural network - Validation score - 50% AUC
  6. Fine tuned neural network - Validation score - 85% AUC
  7. Explainable AI usign shap Kernel explainer

Results:

  • Sub-samplling features and reducing data improved the modeling metric score
  • Using Keras-Tuner validation AUC score improved to 85% from 50%

Acknowledgement: TMLC Academy