/Home-Credit-Default-Risk

An end to end Machine Learning Case Study, which focusses on building a predictive model by leveraging the dataset provide by Home Credit Group for identifying Potential Loan Defaulters.

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

Home-Credit-Default-Risk

This is an end to end Machine Learning Case Study, which focusses on building a predictive model by leveraging the dataset provide by Home Credit Group for identifying Potential Loan Defaulters.
The Repository contains following files:

  1. EDA - Home Credit Default.ipynb:
    This ipynb contains the in-depth EDA for the given dataset. Kindly note that some of the plots might not be visible in the github page (plotly plots), which can be viewed by opening the notebook using nbviewer.
  2. Feature Engineering and Modelling.ipynb
    This notebook contains the detailed Feture Engineering and Modelling on the given dataset.
  3. Final.ipynb:
    This notebook contains the final pipeline, where the we can directly get the Predictions by just giving the inputs to the pipeline, which does all the pre-processing and predictions by itself.
  4. Deployment Model Trainnig - 300 Features.ipynb
    This notebook contains training a model on reduced feature set to reduce the computations requirements for the Deployed Model. This is done keeping in mind the configuration of the AWS EC2 micro instance.
  5. Deployment Folder
    This folder contains all the necessary files which would be needed for deploying the web-app on any remote server. Due to file size limitation, the Database is missing from this folder, which can be downloaded from here, and pasted to inside this 'Final Pipeline Files' folder inside this folder.
    The deployed model can be tested from the link: http://ec2-18-222-96-92.us-east-2.compute.amazonaws.com:5000/