Basic exploration environment to engage with the Retail Rocket Kaggle dataset. Based on Jupyter, Spark, Postgres and Docker.
- Docker
- docker-compose
- Download the set from https://www.kaggle.com/retailrocket/ecommerce-dataset/version/4# and unzip into postgres/dataset
- run docker-compose up
- get the jupyter url from the console and open it in your favourite web browser