/Strata2017

Primary LanguageJupyter Notebook

# Hands on Lab - Data Science Experience

Description: Follow this lab and perform hands on exercises with IBM Data Science Expereince. Use a public data set of car accidents in united states and perform basic corrolations. Data Science Experience allows you to easily develop and share a model that scales using Apache Spark, Python or RStudio.

Instructions:

Step 1. Get on IBM Data Science Experience (DSX).

Create an account.

  1. Go to http://datascience.ibm.com/

  2. Click the signup button on the top right

  1. Enter your email and press continue. Note: If you have a Bluemix account you can click continue with Bluemix credentials, otherwise click create your Bluemix account and enter your email.
  1. You should get an email from "ibmacct" with your IBMid Confirmation code
  1. Then, on the next page fill in the corresponding fields and click CREATE ACCOUNT
  1. In the new page, write your email and click CONTINUE
  1. Write your recently generated password and click on SIGN IN
  1. It will take a minute to create your account. When ready, click on Get Started.

You are now in the Data Science Experience landing page. Your environment is automatically set up with one Apache Spark instance and 5 GB of object storage. From here you can explore any of the tutorials, videos, sample notebooks, totorials or articles in the community.

Step 2. Create a project

  1. Click on the left hand side "hamburger" icon and then click on My Projects to see a list of your projects. You should only see a default project.
  1. Click on the create project icon on the top right of the project list.
  1. Type a name for your project. For instance, "DSX Lab". A Spark service and an object storage will be automatically selected as well as a container with a default name. A container is a directory on the object storage. Click on Create.

You are now in your new project where you can create notebooks and data assets as well as add collaborators.

Step 3. Download Repository.

If you are at IBM Booth, then you can skip this part as the files are located on your desktop.

  1. Download Repository from github https://github.com/IBMDataScience/WOW2016 to local directory.
  1. Unzip repo zip files locally.

Step 4. Create a new notebook.

  1. From the your project page, on the "Overview" tab click "add notebook"

  2. Click “Add Notebook”

  1. Enter a notebook Name, Description and pick from File, click browse and pick VehicleAccidentLab.ipynb file downloaded.
  1. Click on Create Notebook

Step 5. Follow Instructions in Notebook.