A repository to work in as I learn Apache Spark and Scala, starting with this Udemy course.
In this directory, ml-100k
is a symbolic link to
../../spark_scala_data/ml-100k
, which is expected to contain the contents of
the ml-100k.zip
archive downloaded from the MovieLens Dataset
archive.
Similarly, nba-shot-logs
is a symbolic link to
../../spark_scala_data/nba-shot-logs
, which is expected to contain the file
shot_logs.csv
downloaded from the Kaggle NBA (2015) Shot Logs
dataset.