This repository is not active
samipsinghal/ml-usingspark
This project demonstrates a complete machine learning pipeline using Python and Apache Spark, covering data preprocessing, exploratory data analysis, model training, and evaluation. It showcases scalable data processing, robust analysis, and diverse modeling approaches for real-world predictive tasks.
Jupyter Notebook