Spark ML Zeppelin
-
Install Python 2.7.x (The one that comes with MacOS/Linux is probably fine).
-
Install and initialize nltk:
-
Install library using pip:
pip install nltk
-
Download nltk data
import nltk nltk.download()
(Details: http://www.nltk.org/data.html)
-
Download and install Zeppelin 0.7.0:
-
Download from https://zeppelin.apache.org/download.html
-
Unzip the downloaded file.
-
Run:
(zeppelin home)/bin/zeppelin-daemon.sh start
-
Clone this project.
-
Configure Zeppelin to use this project's
notebook
directory: -
Open the notebook repos page: http://localhost:8080/#/notebookRepos
-
Change the "Notebook Path" to
.../sparkml-zeppelin/notebook
-
Load the Zeppelin note: http://localhost:8080/#/notebook/2CBEJDES5