#DATA ANALYSIS :: Kaggle's Chicago crime analysis [BIG DATA]
#Copyrights : JOUHRI ANASS & BOUBEKEUR TAHAR
#BUSINESS GOAL : - Interprete the dataset by providing : + Data cleaning & reading + Data query & visualisation + Predictive model (to be defined ..)
#TOOLS : - Project based on : + Apache PySpark + Pandas + Matplotlib & Seaborn + Numpy + IpyLeaflet + Spark Machine Learning Lib
- Executed using :
+ Pyspark (on Jupyter notebook)
#Instructions : - To execute pyspark on Jupyter notebook : export PYSPARK_DRIVER_PYTHON=jupyter export PYSPARK_DRIVER_PYTHON_OPTS='notebook'
- Follow detailed instructions on :
https://blog.sicara.com/get-started-pyspark-jupyter-guide-tutorial-ae2fe84f594f