Midterm

Members of the group:

  1. Feras Rafeh
  2. Bruna Santos
  3. Iranel González

We are doing a modeling project using logistic regression that predicts diabetes among patients.

Dataset Info:

Project Planning:

Day 1

  • EDA

Day 2

  • Data Cleaning:
  1. Remove typos
  2. Correct datatypes
  3. Replace or Drop NaN's

Day 3

  • Data Transformation:
  1. Split data into numerical and categorical
  2. Scale Numerical Features
  3. Encode Categorical Features

Day 4

  • Data Modelling
  1. Create and train the model
  2. Test the model