PySpark

This project in the aim of decreasing the amount of time running the Kmeans algorithm using Big data . Several Optimizations were done .