#DATA ANALYSIS :: Kaggle's Chicago crime analysis [BIG DATA]

#Copyrights : JOUHRI ANASS & BOUBEKEUR TAHAR

#BUSINESS GOAL : - Interprete the dataset by providing : + Data cleaning & reading + Data query & visualisation + Predictive model (to be defined ..)

#TOOLS : - Project based on : + Apache PySpark + Pandas + Matplotlib & Seaborn + Numpy + IpyLeaflet + Spark Machine Learning Lib

- Executed using :
	+ Pyspark (on Jupyter notebook)

#Instructions : - To execute pyspark on Jupyter notebook : export PYSPARK_DRIVER_PYTHON=jupyter export PYSPARK_DRIVER_PYTHON_OPTS='notebook'

- Follow detailed instructions on : 
	https://blog.sicara.com/get-started-pyspark-jupyter-guide-tutorial-ae2fe84f594f