Friend-recommendation-engine

Developed a friend recommendation engine using the Stackoverflow dataset.

This project involved following tasks.

  1. Load Graph datasets
  2. Data Wrangling using Spark Dataframes.
  3. Analytics on data:
    -> Most active users
    -> Most ignored users
    -> Most helpful pairs
  4. Distributed breadth first search for finding friends for users based on their interactions with community.
  5. Visualization of friends network.

Libraries used: pySpark, networkx (for visualization)