problems challenges on Kaggle
Let's start solve This is presentation/training prepared for DataKRK.
I'll be happy if it help you start solve challenges on Kaggle
How to start use it?
git clone https://github.com/slon1024/lets-start-solve-problem-on-kaggle
cd lets-start-solve-problem-on-kaggle
git fetch
Run ipython (jupyter) notebook (inside directory lets-start-solve-problem-on-kaggle
):
ipython notebook
.
Note: you need install ipython
before (and some other needed packages) or just install anaconda.
After this you will have all three steps (in different branches, initial step is on the master
).
- Step 1 is on the branch
step1
(git checkout step1
) - Step 2 is on the branch
step2
(git checkout step2
) - Step 3 is on the branch
step3
(git checkout step3
)
Note: solution is inside this file bike-sharing-demand.ipynb. After switched a branch you should refresh your notebook in a browser.
train.csv
file?
Where is You should download this from Kaggle site (Bike Sharing Demand - data).
Quick review scripts
Note: github allowing you do a qucik review (of course readonly). For bigger scripts this works very slowwwwly.
- Step 0 (initial step)
- Step 1 (improve evaluation a model)
- Step 2 (understand better data and improve a model)
- Step 3 (feature engineering and tuning model)
I recommend to you follow step by step.
If you have any questions or ideas, please let me know.