Source material for Data Science for Telecom Tutorial at Strata Singapore 2015
The source notebook demonstrates building a churn prediction model using Spark and Spark MlLib's pipeline API for cross validation and model tuning. The Pipeline API is available in PySpark in version 1.6 or higher.
The source ipython notebook can be displayed as slides. If you have ipython installed you can serve the slides locally with:
ipython nbconvert ds-for-telco.ipynb --to slides --post serve