Final products of the investigation:
Medium blog post: https://medium.com/@jose.andres.pacheco/machine-learning-in-high-performance-computing-environments-2ded5bd1618f Jupyter Notebook: https://github.com/jeosadn/spark_hadoop/blob/master/Spark_Hadoop_Demo.ipynb