/Cryptocurrencies

Machine learning with elbow curves and K-Means model

Primary LanguageJupyter Notebook

Cryptocurrencies

Project Summary

This project was conducted in Jupyter Notebook with Python mlenv. This project included 4 parts. The first part was preparing the data. Null values were found and dropped. All values were converted to floats and certain columns were dropped. To create two dataframes.

In the second part of the project, the data dimensions were reduced by PCA. A new dataframe pcs_df was created with 3 compononents the data was reduced into by PCA. The third part included clustering the cryptocurrencies using K-Means. This included plotting an elbow curve to determine the best value of k.

image

Looking at the plot above it appears k should be 4. The K-Means was run with 4 clusters as shown below.

image

Then two dataframes were concatenated to set up for the fourth part of the project.

image

Part 4 included plotting a 3-D scatterplot, hvplot table, scaling the data, and a 2-D scatter plot. Images below.

image

image

image

image

Overall the scatterplot was concentrated in the corner with TotalCoinsMined(X) less than 0.2 and TotalCoinsSupply(Y) less than 0.4. There are two notable outliers. Otherwise data seems to be consistent.