My solution to Home Credit Default Risk Prediction 2018

I ranked 402nd (TOP 6%) in the Home Credit Default Risk Prediction on Kaggle platform.

In this repository you can find:

  • aggregations with code from neptunml.ipynb - notebook with data processing code (based on public solution)
  • data preparation.ipynb - notebook with my data processing code
  • intersections.ipynb - information on how much all files are interconnected by ID
  • pooling and learning.ipynb - notebook with code of pooling all dataframes and learning my best model, which was used in the final ensemble