sarmstr5/cs657_mining_massive_datasets
Homework assignments for CS657 mining massive datasets. Assignments are in Spark and Hadoop using the Python API. Assignments include wordcount stuff, association rule mining, linear regression, and recommender systems.
Jupyter NotebookGPL-3.0