Cement_Strength_Prediction

This repository code solves the problem of flawed cement mixture, which result into poor strength of Buildings and Structures using ML.

Steps followed:

Performed EDA on dataset.
Find the correlation between features.
Clustered the dataset using KMeans into 3 cluster.
Trained model for each cluster and evaluated their performance.
Increased performance of models is found when using clustered data, in comparison to using whole dataset.
XGBoost performed the best on clustered and whole dataset with an R2 score of 0.899001 and 0.929995 respectively.

Orange curve represent models trained on clustered data. The performance of XGBoost and Linear Regression was better.

Traplekumar/Cement_Strength_Prediction