/diabetes_prediction

Data Analysis and prediction of diabetic or non-diabetic in pregnant women.

Primary LanguageJupyter Notebook

Diabetes Prediction Notebook

A data analysis and predicting model of the PIMA Indians Diabetes dataset from Kaggle.

Live Demo

Dataset

Shape of the dataset: 768 rows and 9 columns

  • Pregnancies - Number of times pregnant
  • Glucose - Plasma glucose concentration a 2 hours in an oral glucose tolerance test
  • BloodPressure - Diastolic blood pressure (mm Hg)
  • SkinThickness - Triceps skin fold thickness (mm)
  • Insulin - 2-Hour serum insulin (mu U/ml)
  • BMI - Body mass index (weight in kg/(height in m)^2)
  • DiabetesPedigreeFunction - Diabetes pedigree function
  • Age - Age (years)
  • Outcome - Class variable (0 or 1) 268 of 768 are 1, the others are 0

Source

The original dataset is available at UCI Machine Learning Repository

EDA

Correlation Matrix

Histograms