/Spark-with-Scala-Project

Using the features of Spark with programming power of Scala to crunch large data-sets for data analysis.

Primary LanguageScala

Spark-with-Scala-Project

Using Spark's immutable RDDs and diverse functions of Scala to crunch a large dataset for data analysis. Also to deploy a Spark Standalone Cluster and use the parallelization feature in Spark to run the Monte-Carlo algorithm for two hundred thousand simulations in order to reach an accurate approximation of Pi.

For detailed instructions on the project, please watch the following videos:-

  1. Data Analysis using Spark with Scala transformation functions: https://www.youtube.com/watch?v=jeIHdxD_a54&t=35s
  2. Spark Standalone Cluster Deployment with Scala Program: https://www.youtube.com/watch?v=nx_v721rc9A&t=79s