/pyspark-yelp-data-analysis

A comparative study to understand the computing efficiencies of Pyspark architectures vs python based distributed programming methodologies such as MPI, multi-threading or multi-processing on the Yelp kaggle dataset.

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

No issues in this repository yet.