/cs657_mining_massive_datasets

Homework assignments for CS657 mining massive datasets. Assignments are in Spark and Hadoop using the Python API. Assignments include wordcount stuff, association rule mining, linear regression, and recommender systems.

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

Watchers