This repository contains implementations of K-Means Clustering and Hierarchical Clustering.
The current hierarchical clustering algorithms uses agglomeration and division to create the clusters and uses the following linkages:
- Single Link (MIN)
- Complete Link (MAX)
- Group Average (AVG)
The agglomerative hierarchical clustering script is divided into two classes:
- Agglomerative_Hierarchical
- Proximity_Matrix
- Divisive_Hierarchical
The algorithms were run on a dataset consisting of amino acid sequences. The results are published as dendrograms:
K-Means Clustering
The K-means algorithm currently clusters the sequences into 311 clusters.
Hierarchical clustering:
- Single Link
- Complete Link
- Group Average
- DIANA
- Numpy
- Scipy
- Matplotlib