This repo contains the source code and support materials for my talk Apache Spark 101.
In order to run the example you must first install pyspark
:
python3 -m venv venv
source venv/bin/activate
pip install pyspark
The setup.sh
file is also provided for convenience.
If you have any questions, please reach out to me at daniela.petruzalek@gmail.com. I'm also on Twitter as @danicat83.