Deep Learning Tutorials on Apache Spark using BigDL

Step-by-step Deep Learning Tutorials on Apache Spark using BigDL. The tutorials are inspired by Apache Spark examples, the Theano Tutorials and the Tensorflow tutorials.

Topics

  1. RDD
  2. DataFrame
  3. SparkSQL
  4. StructureStreaming
  5. Forward and backward
  6. Linear Regression
  7. Introduction to MNIST
  8. Logistic Regression
  9. Feedforward Neural Network
  10. Convolutional Neural Network
  11. Recurrent Neural Network
  12. LSTM
  13. Bi-directional RNN
  14. Auto-encoder
  15. Visualizing Learning

Environment

Start Jupyter Server

  • Run pip install BigDL==0.7.0
  • Run jupyter notebook --notebook-dir=./ --ip=0.0.0.0 --no-browser

Start Toree Kernel to Run Scala Notebooks

  • Run pip install BigDL==0.7.0
  • Run pip install https://dist.apache.org/repos/dist/release/incubator/toree/0.2.0-incubating/toree-pip/toree-0.2.0.tar.gz
  • Run ./toree_install.sh
  • Run jupyter notebook --notebook-dir=./ --ip=0.0.0.0 --no-browser

Run Demo

  • Open a browser - Suggest Chrome or Firefox or Safari
  • Access notebook client at address http://localhost:8888, open the example ipynb files and execute.