- Project Overview
- Highlight
- Data Source
- Data Preprocessing Tools and Techniques
- Exploratory Data Analysis
- Model Evaluation
- Insights
- Considerations
Investors seek accurate cryptocurrency price predictions to maximize profits in this volatile market. This project involved predicting cryptocurrency prices using machine learning models. The goal was to forecast prices across different time horizons to aid investment decisions.
- Data collection using Binance API
- Exploratory data analysis
- Feature engineering
- Modeling with LSTM, XGBoost, Random Forest, and evaluation
- GUI development with Streamlit
Historical daily price data from Binance API for the top 20 cryptocurrencies from 2017-2023. The dataset included various market data variables such as open, high, low, close prices, volume, and more.
- Python
- Pandas, Matplotlib, Seaborn for EDA
- Sklearn for preprocessing
- Statistical Analysis
- Time series analysis
- Correlation Analysis
- Feature Engineering
- Normalization
- Decision TreeRegressor, XGBoost, Random Forest for predictive modeling
- Streamlit for Graphical User Interface
Data was examined for patterns, trends, and outliers using statistical analysis and visualizations
Performance metrics, including Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), Root Mean Squared Error (RMSE), and R-squared (R2), were calculated for model evaluation.
- Most coins exhibited positive skewness in their price distribution. Positive skewness indicates that the majority of cryptocurrencies tend to have prices skewed towards higher values, this suggests potential for sharp upward movements in prices. This indicates investors may benefit from long positions.
- High volatility observed in prices for all cryptocurrencies indicates frequent and significant price fluctuations in the cryptocurrency market. This can present opportunities for traders but also increased risks. The need for risk management strategies is required to minimize the downside.
- There is a significant positive correlation found between Bitcoin and other major coins like Ethereum. Investors in altcoins should be aware that the performance of these coins can be influenced by Bitcoin's price. Diversifying across different coins may not always provide complete independence from Bitcoin's price trends
- Feature importance analysis revealed that open, high and low prices were top predictors
- Short-term predictions were most accurate, with decreasing accuracy for longer horizons
- Random Forest outperformed Decision Tree and XGBoost, achieving the best R2 score. The model provided reliable short and medium-term predictions.
- Model selection and hyperparameter tuning required time and experimentation.
- Feature engineering involved trial and error.
- Balancing model complexity and generalization to avoid overfitting or underfitting required careful consideration..