/AV_Student_Hunt

2nd place solution for AV student hunt time series forecasting problem statement

Primary LanguagePython

AV Student Hunt: Time Series Forecasting

2nd place solution for AV student hunt time series forecasting problem statement

Packages used:

pandas 0.20.1
numpy 1.12.1
scikit-learn 0.19.0
statsmodel 0.8.0

Instructions:

  1. Folder Data contains train.csv and test.csv
  2. eda_plots.py is used for EDA and identifying outliers
  3. Final Model_Tweedie Regression.py is used for final modelling process in which Tweedie regression is used as a ML model.
  4. Final Submission Tweedie.csv is the final submission of predictions on test dataset.

Resources:

A Note on Tweedie
Tweedie Distribution: Definition and Examples
Tweedie distribution