Sample programs using Apache Spark for distributed computing on a cluster.
degrees-of-separation.py
uses Breadth-First-Search to find the degrees of separation between superheroes in the Marvel universesimilar-movies.py
uses Item-Based-Collaborative-Filtering and Cosine Similarity to recommend similar movies based on movie ratings