/db-spark-projects

EPFL Database Systems Project 2

Primary LanguageJupyter Notebook

EPFL Database Systems class - Project2

This repo contains the solution of the Database System class @ EPFL, developed by me and @manuleo.

We used Scala and Spark (using RDDs only for the first 2 exercises, as Spark SQL and Spark DataFrame were not allowed). This project can be divided in 3 tasks:

We have also written unit tests for the latter 2 tasks.

Finally, we have also written a report about the results we obtained.

Our project has been graded 6/6.