/ml-usingspark

This project demonstrates a complete machine learning pipeline using Python and Apache Spark, covering data preprocessing, exploratory data analysis, model training, and evaluation. It showcases scalable data processing, robust analysis, and diverse modeling approaches for real-world predictive tasks.

Primary LanguageJupyter Notebook

Stargazers

No one’s star this repository yet.