-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
step-by-step instructions for viz in blog
- Loading branch information
Showing
1 changed file
with
43 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,43 @@ | ||
# visualize-data-fast | ||
Describe how to visualize data fast using the Data Refinery tool in Watson Studio | ||
# Visualize Data Fast | Watson Studio. | ||
## Click [here](https://medium.com/@jorge_castanon/visualize-data-fast-watson-studio-ae1ec63e9b8f?source=friends_link&sk=c9fc9c85f364bdc221a93c1ae78c26db) for blog post. | ||
|
||
|
||
|
||
## Step by step instructions to reproduce the visualizations: | ||
|
||
1. Download the 1,000-row sample data set from [here](https://ibm.box.com/s/6fltz5ilap8pbwzu2tt1yxil6ldosc9d). The file's name is `thermostat_rebates_by_zip_1000.csv`. | ||
|
||
1. Create an account on Watson Studio [cloud](https://www.ibm.com/cloud/watson-studio) or download the desktop version [here](https://www.ibm.com/products/watson-studio-desktop). | ||
|
||
1. Open Watson Studio. | ||
|
||
1. Click `New project` on the top right to create a new project on Watson Studio. | ||
|
||
1. Name your project and click `Create` on the bottom right. | ||
|
||
1. Click the `Assets` tab if you are not already there. | ||
|
||
1. Upload the `thermostat_rebates_by_zip_1000.csv`, on the right hand side of the screen drop or browse the file. | ||
|
||
1. In your project, under `Data assets`, click the data set to see a preview of the data set. | ||
|
||
1. Click the `Refine` blue box in the top right to open the data set with the Data Refinery tool. | ||
|
||
1. Once the Data Refinery tool is open, navigate to the `Visualizations` tab | ||
|
||
1. Create the histogram: | ||
1. Select the `Histogram chart` on the CHART TYPES. | ||
1. Select the column "value" (thermostat rebates in USD) as the `X-axis`. | ||
1. Un-select the `Show kde curve` and the `Show distribution curve` and choose `Bin width` to be 4. | ||
|
||
1. Create the map: | ||
1. Select column "lng" as the Longitude field and column "lat" as the Latitude field. | ||
1. Select column "value" (thermostat rebates in USD) as the Size map field. | ||
1. Zoom-in to the interesting areas of the map. | ||
|
||
1. Create the scatterplot woth correlations: | ||
1. Select column "value". | ||
1. Click `Add another column` and select column "median". | ||
1. Click `Add another column` and select column "mean". | ||
1. Click `Add another column` and select column "population". | ||
1. Only strong correlation is between "median" and "mean" which is not surprising (the mean and median household income are similar statistics). |