/Clustering-Algorithms-Spark

KMeans, Cure and Canpoy algorithms are demonstrated using Pyspark.

Primary LanguageJupyter Notebook

Clustering-Algorithms-Spark

The different clustering algorithms demonstrated using pyspark in this repository are:

  1. KMeans
  2. Cure
  3. Canpoy

The codes can be found in the respective directories along with the datasets.

These programs were written as a part of Big Data Analytics Lab.