/UndErNsembled

Reducing imbalanced dataset (Undersampling) by Consensus Clustering (Simple Majority Voting function) and validating the changes using different classifier model with bagging and boosting techniques.

Primary LanguageJupyter Notebook

UndErNsembled:

About the model:

In this project, we reduced an imbalanced dataset (Undersampling) by Consensus Clustering using 'Simple Majority Voting' consensus function and further saw the increase in the accuracy of disease prediction by running multiple classifers with bagging and boosting technique.

Dataset:

The dataset we have is the colon cancer dataset of (62x2000) dimension.

Result:

This is the final result, i.e. comparison of different classifiers of predicting the disease accurately in both balanced and imbalanced data.