nsurampu/Clustering-Systems

A repo of clustering systems

Python

Clustering Systems

This repository contains implementations of K-Means Clustering and Hierarchical Clustering.

K-Means

Hierarchical Clustering

The current hierarchical clustering algorithms uses agglomeration and division to create the clusters and uses the following linkages:

Single Link (MIN)
Complete Link (MAX)
Group Average (AVG)

The agglomerative hierarchical clustering script is divided into two classes:

Agglomerative_Hierarchical
Proximity_Matrix
Divisive_Hierarchical

Results

The algorithms were run on a dataset consisting of amino acid sequences. The results are published as dendrograms:

K-Means Clustering

The K-means algorithm currently clusters the sequences into 311 clusters.

Hierarchical clustering:

Single Link

Complete Link

Group Average

DIANA

Libraries Used

Numpy
Scipy
Matplotlib

Authors

![Aditya Srikanth]
![Prateek Das Gupta]