/Predicting_Incident_HeartFailure_Disease_Based-on-Transformers

This repository hosts a cutting-edge deep learning model developed to predict 6-month incident heart failure utilizing electronic health records (EHRs). Heart failure is a multifaceted medical condition characterized by its significant impact on patients' well-being and healthcare systems.

Primary LanguagePython

HeartFailure_Disease_Based-on-Transformers

This project aims to predict the presence of heart disease based on various medical factors. The dataset used in this project contains several attributes such as age, sex, cholesterol levels, and exercise-induced angina, among others, which are used to train machine learning models.

Dataset

The dataset used in this project is named "heart.csv". It contains the following columns:

Age Sex Chest pain type Resting blood pressure Serum cholesterol Fasting blood sugar Resting electrocardiographic results Maximum heart rate achieved Exercise induced angina ST depression induced by exercise relative to rest Slope of the peak exercise ST segment Number of major vessels colored by fluoroscopy Thal

Getting Started

To run this project locally, follow these steps:

Clone this repository to your local machine.

Install the required dependencies using the following command:

pip install -r requirements.txt

Execute the code in a Python environment. The main script is named heart_disease_prediction.py.

Exploratory Data Analysis (EDA)

The script starts with loading the dataset and performing exploratory data analysis (EDA) tasks such as displaying the first few rows, statistical summary, and information about the dataset.

Visualization techniques such as count plots and bar plots are used to understand the distribution and relationships between variables.

Model Building and Evaluation

  • Several machine learning algorithms are used to build predictive models including: Logistic Regression Naive Bayes Support Vector Machine (SVM) K-Nearest Neighbors (KNN) Decision Tree Random Forest XGBoost Neural Network
  • Each model is trained on the training set and evaluated on the test set. Accuracy scores are computed and printed out.

Machine Learning Models:

Employ a range of machine learning algorithms, including Logistic Regression, Naive Bayes, Support Vector Machine, Decision Tree, Random Forest, and XGBoost. These models are trained and evaluated to gauge their effectiveness in predicting heart disease.

Deep Learning Model:

Develop a sophisticated neural network using Keras and TensorFlow tailored for heart disease classification. The deep learning model undergoes thorough preprocessing, normalization, architecture setup, training, and evaluation.

Toolkit Components:

Heart Disease Analyzer:

  • Conducts in-depth exploratory data analysis on the heart disease dataset, visualizing correlations, distributions, and patterns within the data.
  • Trains and evaluates machine learning models, including Random Forest, Logistic Regression, and Naive Bayes, for heart disease prediction.

Heart Disease Predictor:

  • Implements various machine learning algorithms, including Logistic Regression, Naive Bayes, Support Vector Machine, Decision Tree, Random Forest, XGBoost, and a Neural Network, for heart disease prediction.
  • Evaluates model performance using accuracy scores and cross-validation techniques.

Heart Disease ML Assistant:

  • Contains functions for data preprocessing, visualization, and machine learning model training.
  • Offers capabilities to preprocess the heart disease dataset, train machine learning models, and evaluate their performance.

Heart Disease Neural Navigator:

  • Utilizes advanced deep learning techniques to predict heart disease, constructing a powerful neural network model using Keras and TensorFlow.
  • Encompasses comprehensive preprocessing, normalization, model architecture setup, intensive training, and meticulous evaluation of the deep learning model.

How to Use:

  • Clone this repository to your local machine.
  • Ensure you have the necessary dependencies installed, which are specified in the requirements.txt file.
  • Execute the appropriate scripts based on your preferred analysis technique.
  • Follow the instructions provided within each script to preprocess the data, train the models, and evaluate their performance.

Results

  • The accuracy scores of different algorithms are plotted in a bar chart to visualize the performance of each model.

Contributors

Prem Chand Koru