/loan_classification

ML Models to classify consumer data for loan approval

Primary LanguagePython

loan_classification

Creating various ML Models to classify consumer data for loan approval.

Kaggle dataset -> https://www.kaggle.com/ajay1735/hmeq-data

Context

The consumer credit department of a bank wants to automate the decisionmaking process for approval of home equity lines of credit. To do this, they will follow the recommendations of the Equal Credit Opportunity Act to create an empirically derived and statistically sound credit scoring model. The model will be based on data collected from recent applicants granted credit through the current process of loan underwriting. The model will be built from predictive modeling tools, but the created model must be sufficiently interpretable to provide a reason for any adverse actions (rejections).

Content

The Home Equity dataset (HMEQ) contains baseline and loan performance information for 5,960 recent home equity loans. The target (BAD) is a binary variable indicating whether an applicant eventually defaulted or was seriously delinquent. This adverse outcome occurred in 1,189 cases (20%). For each applicant, 12 input variables were recorded.

Models used

  • Logistic Regression
  • Random Forest

Results

Random Forest Classifier

Screen Shot 2022-07-03 at 4 29 07 PM

  • With Cross Validation

image

  • Confusion Matrix:

image

Logistic Regression

Screen Shot 2022-07-03 at 4 29 55 PM

  • With Cross Validation

image

  • Confusion Matrix:

Screen Shot 2022-07-03 at 4 30 35 PM