/Bank-Loan-Application-Status-Prediction

The loan default dataset has 8 variables and 850 records, each record being loan default status for each customer. Each Applicant was rated as “Defaulted” or “Not-Defaulted”. New applicants for loan application can also be evaluated on these 8 predictor variables and classified as a default or non-default based on predictor variables.

Primary LanguageJupyter Notebook

Bank Loan Application Status Prediction

A Machine Learning Web App built with Flask

Problem Statement

The loan default dataset has 8 variables and 850 records, each record being loan default status for each customer. Each Applicant was rated as “Defaulted” or “Not-Defaulted”. New applicants for loan application can also be evaluated on these 8 predictor variables and classified as a default or non-default based on predictor variables.

Data Set

bank-loan.csv

Number of Attributes

There are total 8 attributes which are given below :-

Variable Name - Variable Description - Variable Type

  1. Age - Age of each customer - Numerical
  2. Education - Education categories - Categorical
  3. Employment - Employment status - Numerical
  4. Address - Geographic area - Numerical
  5. Income - Gross Income - Numerical
  6. Debtinc - Individual’s debt - Numerical
  7. Creddebt - Debt to credit ratio - Numerical
  8. Othdebt - Any other debts - Numerical

Missing Values

Yes

Tools & Technologies

  1. Python
  2. R
  3. Machine Learning
  4. Flask
  5. Spyder
  6. Jupyter Notebook
  7. RStudio
  8. HTML
  9. CSS

Algorithms

  1. Logistic Regression
  2. Decision Tree
  3. Random Forest

Libraries

  1. os
  2. numpy
  3. pandas
  4. pickle
  5. matplotlib
  6. seaborn
  7. sklearn

Performance Metrics

  1. Accuracy
  2. Recall
  3. Precision
  4. Specificity
  5. F1 Score
  6. AUC - ROC Score
  7. False Positive Rate
  8. False Negative Rate

Visualizations

  1. Bar Graph
  2. Pie Chart
  3. Pair Plot
  4. Box Plot

Table of Contents

  1. Importing libraries
  2. Loading data set
  3. Missing value analysis
  4. Distribution of target variable
  5. Multicollinearity analysis
  6. Outlier Analysis
  7. Missing Values Imputation
  8. Standardization
  9. Model training
  10. Performance metrics
  11. Model selection
  12. Freezing best model

A glimpse of the Web App

welcome-page loan-approval-page loan-rejection-page