Start Pyspark Jupyter Env

Built with docker-compose==1.29.2

https://docs.docker.com/compose/gettingstarted/

Use start_pysparkjupyter.sh to start the environment

Add-ons

  • You can configure Kerberos in krb5.conf in case you're working with a kerberized cluster

Docker commands

docker-compose -f pysparkjupyterenv.yml down && docker-compose  -f pysparkjupyterenv.yml build
docker-compose -f pysparkjupyterenv.yml up -d
docker exec -u 0 -it pysparkjupyter_container service ssh start