/Data-Science-Process-KNeighbors-Classifier---GA-Lab-4.01

Establishing/walking through the best-practice Data Science Process: 1. Define the Problem; 2. Obtain the data; 3. Explore the data; 4. Model the data; 5. Evaluate the model; 6. Answer the problem. Also, working with KNeighbors Classifier.

Primary LanguageJupyter Notebook

4.01-lab-classification_model_comparison

Week 4, Lab 1: Classification Model Comparison

Author: Matt Brems

Predicting Left-Handedness from Psychological Factors

We can sketch out the data science process as follows:

  1. Define the problem.
  2. Obtain the data.
  3. Explore the data.
  4. Model the data.
  5. Evaluate the model.
  6. Answer the problem.

We'll walk through a full data science problem in this lab. Specifically, we'll use a survey of psychological factors to predict whether or not individuals are left-handed. Along the way, we'll compare multiple classification models and see which best serve our needs.