Instructions:

#Step 1. Get on IBM Data Science Experience (DSX).

Create an account.

Go to http://datascience.ibm.com/
Click the signup button on the top right

If you have a Bluemix account you can click continue with Bluemix credentials, otherwise click create your Bluemix account and enter your email.
You should get an email from "ibmacct" with your IBMid Confirmation code

Then, on the next page fill in the corresponding fields and click CREATE ACCOUNT

In the new page, write your email and click CONTINUE

Write your recently generated password and click on SIGN IN

It will take a minute to create your account. When ready, click on Get Started.

You are now in the Data Science Experience landing page. Your environment is automatically set up with one Apache Spark instance and 5 GB of object storage. From here you can explore any of the tutorials, videos, sample notebooks, totorials or articles in the community.

Step 2. Create a project

Click on the left hand side "hamburger" icon and then click on My Projects to see a list of your projects. You should only see a default project.

Click on the create project icon on the top right of the project list.

Type a name for your project. For instance, "DSX Lab". A Spark service and an object storage will be automatically selected as well as a container with a default name. A container is a directory on the object storage. Click on Create.

You are now in your new project where you can create notebooks and data assets as well as add collaborators.

Step 3. Get the data into DSX

Click here to download this repository to your computer to access the data stored in the data directory.
Unzip this zip file on your computer so you have a directory with all the assets in the repository. We will be using the data from the data directory. The screenshot below shows dragging the contents to the desktop for easy access:

Go to your recently created project on DSX and click on the add data assets + icon

Click on the Add file and select the transactions.csv file from your computer and click on open

Once the file is loaded, click on Apply to add this file to your project.

Click Apply on the pop-up:

You should see transactions.csv under the data assets list of your project. Your data is now loaded in your object storage in the container associated to your project. If your project name is "DSX Lab", the default container name is DSXLab (unless you change to a different name on Step 2, part 3).

Step 4. Importing Notebooks for Machine Learning Lab

From the your project page, on the "Overview" tab click "add notebook"

In the next screen named “Create Notebook”, switch to “From File” tab, name the notebook “ML Lab Installation”, and choose the notebook file on your disk from the archive: notebooks/ml-lab-installation.ipynb; alternatively you can switch to “From URL” tab and use the following “Notebook URL”:

https://raw.githubusercontent.com/IBMDataScience/wow-lab-to-production/master/notebooks/ml-lab-installation.ipynb

Click Create Notebook at the bottom of the page to add the notebook
Run all the cells in the notebook clicking on the Run All option under Cells

Once the libraries have been installed, all the cells will have a number present on the left side of the notebook between square brackets.
Click File -> Save or the Floppy disk icon to save the notebook
Return back to the project overview page by clicking on "DSX Lab" or the name you gave your project

NOTE: the software packages installation may take a few minutes, but it needs to be done only once per account

Load the second notebook “Machine Learning with DSX - Lab” (from the file machine-learning-with-DSX-lab.ipynb, or from URL https://raw.githubusercontent.com/IBMDataScience/wow-lab-to-production/master/notebooks/machine-learning-with-DSX-lab.ipynb ) by following the same steps 1-3 as above

Step 5. Adding data from Object Storage in the Notebook

From the loaded notebook “Machine Learning with DSX Lab” click on "Find and add data":

2. The expanded "Find and add data" would show transaction.csv under “Files” section

Follow the instructions in the cell of the notebook shown below:

4. After inserting the code, at then end you will see something that looks like this:

 ```R
 df.data.1 <-  read.csv(file = getObjectStorageFileWithCredentials_92c679820c6ebdd53("DSXLab", "transactions.csv"))
 head(df.data.1)    
 ```

Replace df.data.1 with df

 ```R
 df <-  read.csv(file = getObjectStorageFileWithCredentials_92c679820c6ebdd53("DSXLab", "transactions.csv"))
 head(df)    
 ```

Check point:

After the modifications, the section code should define a data frame variable df which is used in the notebook; the modifications should be done only for replacing the variable in the last 2 lines of code shown above.

Step 6. Generate a decision tree model with visualizations in R

Begin execution of every code section in the order in which the sections appear by clicking on the button or by using the menu Cell> Run Cells. The lab covers the following actions:

a. Declaring the libraries used in the lab

b. Loading the data from the Object Storage into a data frame

c. Transforming the data for using with C5.0

d. Training the classification model (C5.0)

e. Transforming the classification model to visualize it in Brunel

f. Using a tree map for visualizing and exploring the decision tree in Brunel

g. Using a tree for exploring the decision tree in Brunel

h. Showing the native R visualization of the decision tree for comparison
Stop the kernel (File > Stop Kernel) and go back to the project overview page or DSX home page

Additional Activity

Create and Visualize SPSS CHAID Decision Tree in Scala

Load the this notebook “SPSS Decision Tree and Visualization” (from the file machine-learning-with-DSX-lab.ipynb, or from URL https://raw.githubusercontent.com/IBMDataScience/wow-lab-to-production/master/notebooks/SPSS%2BDecision%2BTree%2Band%2BVisualization.ipynb ) by following the same steps 1-3 as above
This notebook also uses the transactions.csv data set.
Follow the instructions in the cell of the notebook shown below:

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
images		images
notebooks		notebooks
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contents

Instructions:

Create an account.

Step 2. Create a project

Step 3. Get the data into DSX

Step 4. Importing Notebooks for Machine Learning Lab

Step 5. Adding data from Object Storage in the Notebook

Check point:

Step 6. Generate a decision tree model with visualizations in R

Additional Activity

Create and Visualize SPSS CHAID Decision Tree in Scala

Help us make DSX great! Go here to take a 5 question survey about using Data Science Experience for this lab.

End of Lab

About

Releases

Packages

Languages

IBMDataScience/wow-lab-to-production

Folders and files

Latest commit

History

Repository files navigation

Contents

Instructions:

Create an account.

Step 2. Create a project

Step 3. Get the data into DSX

Step 4. Importing Notebooks for Machine Learning Lab

Step 5. Adding data from Object Storage in the Notebook

Check point:

Step 6. Generate a decision tree model with visualizations in R

Additional Activity

Create and Visualize SPSS CHAID Decision Tree in Scala

Help us make DSX great! Go here to take a 5 question survey about using Data Science Experience for this lab.

End of Lab

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages