A tutorial on getting started with Spark 2.1 utilizing PySpark, Spark DataFrames, Spark SQL, ML Pipelines, and more.
The code and walk-through are contained in nba_spark.ipynb
.
Linear regression plot:
Querying and analyzing geospatial shot chart data:
Points per shot plot: