/visualize-data-fast

Describe how to visualize data fast using the Data Refinery tool in Watson Studio

Visualize Data Fast | Watson Studio: Blog

Step by step instructions to reproduce viz in blog

  1. Download the 1,000-row sample data set from here. The file's name is thermostat_rebates_by_zip_1000.csv.

  2. Create an account on Watson Studio cloud or download the desktop version here.

  3. Open Watson Studio.

  4. Click New project on the top right to create a new project on Watson Studio.

  5. Name your project and click Create on the bottom right.

  6. Click the Assets tab if you are not already there.

  7. Upload the thermostat_rebates_by_zip_1000.csv, on the right hand side of the screen drop or browse the file.

  8. In your project, under Data assets, click the data set to see a preview of the data set.

  9. Click the Refine blue box in the top right to open the data set with the Data Refinery tool.

  10. Once the Data Refinery tool is open, navigate to the Visualizations tab.

  11. Create the histogram:

    1. Select the Histogram chart on the CHART TYPES.
    2. Select the column "value" (thermostat rebates in USD) as the X-axis.
    3. Un-select the Show kde curve and the Show distribution curve and choose Bin width equal to 4.
  12. Create the map:

    1. Select the Map chart on the CHART TYPES.
    2. Select column "lng" as the Longitude field and column "lat" as the Latitude field.
    3. Select column "value" (thermostat rebates in USD) as the Size map field.
    4. Zoom-in to the interesting areas of the map.
  13. Create the scatterplot woth correlations:

    1. Select the Scatterplot chart on the CHART TYPES.
    2. Select column "value".
    3. Click Add another column and select column "median".
    4. Click Add another column and select column "mean".
    5. Click Add another column and select column "population".
    6. Only strong correlation is between "median" and "mean" which is not surprising (the mean and median household income are similar statistics).