/Credit_Risk_Analysis

Creating various machine learning models to create the most accurate model to predict credit risk

Primary LanguageJupyter Notebook

Credit_Risk_Analysis

Overview of Analysis

The purpose of this analysis is to create various machine learning models to see if I can create a model that accurately predicts credit risk.

Results

  1. Naive Random Oversampling
  • Accuracy score: 0.64

image

  • Precision score: 0.01
  • Recall/sensitivity score: 0.69

image

  1. SMOTE Oversampling
  • Accuracy score: 0.66

image

  • Precision score: 0.01
  • Recall/sensitivity score: 0.63

image

  1. Cluster Centroids
  • Accuracy score: 0.54

image

  • Precision score: 0.01
  • Recall/sensitivity score: 0.69

image

  1. SMOTEENN
  • Accuracy score: 0.67

image

  • Precision score: 0.01
  • Recall/sensitivity score: 0.76

image

  1. Balanced Random Forest
  • Accuracy score: 0.79

image

  • Precision score: 0.03
  • Recall/sensitivity score: 0.7

image

  1. Easy Ensemble AdaBoost
  • Accuracy score: 0.93

image

  • Precision score: 0.09
  • Recall/sensitivity score: 0.92

image

Summary

AdaBoost had the highest scores with an accuracy of 0.93, precision of 0.09, and sensitivity of 0.92. None of the models have a very good precision score, so I would not recommend them if you would like high precision in detecting credit risk. If you value sensitivity in your model, I would recommend using the AdaBoost as it had the highest sensitivity rating.