/apache-spark

Applying big data analytics to College Scorecard dataset using with Apache Spark and PySpark.

Primary LanguageJupyter Notebook

Big Data Analytics with Apache Spark

This repository documents my journey of learning Apache Spark, from installation to programming.

The following technologies are used:

  • Apache Spark
  • PySpark
  • Google Colab
  • Google Drive

References - Technology

References - Data

References - Business