/spark

Primary LanguageJupyter Notebook

spark

this repository is to include all my project in spark
1- using spark to predict death ---> using preprocessing (removing nulls, feature engineering , one hot encoding and logistic regression all in a pipe line )

2- preprocess, sql and ml spark (the scattered version with some processing using sql )