/Data-Exploration-using-Apache_spark-RDDs

Exploring Stack-Exchange Data Science dump using Apache Spark

Primary LanguageJupyter Notebook

Data-Exploration-using-Apache_spark-RDDs

Exploring Stack-Exchange Data Science dump using Apache Spark.

  • Reading in the data to Spark RDDs
  • Performing various actions to understand and analyze the data