/NYC_TaxiTripDuration_MS

Visualizations and EDA for NYC Taxi Trip Duration Kaggle competition

Primary LanguageJupyter NotebookMIT LicenseMIT

NYC Taxi Trip Duration - Kaggle competition (2017-09-15)

This repository contains experiments I made for the Kaggle New York City Taxi Trip Duration competition.

plot preview

See the notebook on kaggle.com.

Getting started

Clone the repository:

git clone https://github.com/martinsotir/NYC_TaxiTripDuration_MS/blob/master/LICENSE
cd ./NYC_TaxiTripDuration_MS

Download Kaggle NYC 2016 taxi trip duration data:

  • Put train.zip and test.zip in the ./data directory.

  • Extract zip files:

    unzip ./data/test.zip -d ./data/
    unzip ./data/train.zip -d ./data/

To run notebooks, I recommend using a python virtualenv with conda :

conda env create -f environment.yml
source activate nyc-ms
jupyter notebook