NYC TAXI GREEN DATA PREPROCESSING
Requirements to run the notebook
- This project requires Python 3, scikit-learn, pandas, jupyter-notebook
Quickstart
- Clone the repo
git clone https://github.com/chauvansang/1MG_DE.git && cd 1MG_DE
- Start up jupyter using docker
docker run --rm -p 8888:8888 -e JUPYTER_ENABLE_LAB=yes -v "$PWD":/home/jovyan/work jupyter/scipy-notebook
http://127.0.0.1:8888/?token=xxxxx provided after running the docker command
Then access jupyter at the link
- Open a terminal in the jupyter lab and run
pip install pyarrow geopy
- Go to the work folder and run the nyc_data.ipynb file