/seclusal

SECLUSAL: A SupErvised CLUStering ALgorithm

Primary LanguageJupyter NotebookCreative Commons Zero v1.0 UniversalCC0-1.0

SECLUSAL: A SupErvised CLUStering ALgorithm

This is the repo for a blog post in Medium.

The blog post proposes a supervised clustering method that partitions data points into a limited number of clusters with respect to a target variable, based on the features specified by the user.

The resulting clusters have the following characteristics: 

  1. The target variable has low variance within a cluster, but has high variance between clusters, and 
  2. The data points in a cluster share similar values in the features that are relevant to distinguish the target. 

The method is robust to the presence of irrelevant features and correlated features. This supervised clustering method also helps to increase the interpretability of machine learning models.

The contents of this repository