/Distributed-Computing-with-Big-Data-PageRank-

This project creates a project in PySpark to implement a distributed algorithm for computing PageRank of each webpage on a large web graph. Below, “page” or “web page” means a node on the web graph.

Primary LanguageJupyter Notebook

Distributed-Computing-with-Big-Data-PageRank-

This project creates a project in PySpark to implement a distributed algorithm for computing PageRank of each webpage on a large web graph. Below, “page” or “web page” means a node on the web graph.