This repository contains a PySpark data analysis projects focused on exploring and analyzing various datasets using PySpark's DataFrame API. The project demonstrates the use of PySpark for big data processing, data exploration, transformation, and aggregation tasks. It includes real-world datasets and Jupyter notebooks showcasing the analysis and insights derived from the data.
asvivs/PySpark-Data-Analysis-Project
This repository contains a PySpark data analysis projects focused on exploring and analyzing various datasets using PySpark's DataFrame API.
Jupyter Notebook