coderjolly/pyspark-yelp-data-analysis
A comparative study to understand the computing efficiencies of Pyspark architectures vs python based distributed programming methodologies such as MPI, multi-threading or multi-processing on the Yelp kaggle dataset.
Jupyter NotebookGPL-3.0
No issues in this repository yet.