Overview of this Lecture / week

This week we are going to look at the typical life-cycle used for most Data Science/Data Mining etc projects. This is called CRISP-DM. It has been around a long time and many people and consultancy companies have adopted it. Some have created their own versions of it that. They all do and say the same thing.

The lab work will involve you setting up an account on the SAS Cloud service called SAS OnDemand. You can then connect to my class group, download the SAS Enterprise Miner (EM) app, log-in using this app and create a project. You can use this project for all your lab and assignment work.

The SAS Enterprise Miner is hosted by SAS on their OnDemand Service. DIT has no control or administration access of this software. If you encounter any issues with your account, you will need to contact SAS Support.

Because SAS Enterprise Miner is hosted outside of DIT you will be able to use the software at home, work, in DIT, etc. So you can complete the lab exercises anywhere.

Notes

Click here to download notes for Week 2 notes.

L2 - DM Life Cycles

Videos of Notes

Related Videos

Lab Exercises

Task 1

You will need a network account to be able to log into the computers in the lab.

Or you can use your own laptop (see Task 2 from last week)

Task 2

The SAS Enterprise Miner is hosted by SAS on their OnDemand Service. This is hosted external to DIT. It is a service the SAS provides for Universities and Companies around the World.

DIT has no control or administration access of this software.

Because SAS Enterprise Miner is hosted outside of DIT you will be able to use the software at home, work, in DIT, etc. So you can complete the lab exercises anywhere.

You need to sign up and register to use SAS OnDemand for this class group. You need to have completed all the sign up steps before next weeks class.

If the above link does not work, then copy and pate the following into your internet browser.

After you have successfully created your account, follow these steps:

Sign on the the Control Center at https://odamid.oda.sas.com.

Look for the Enroll in a course link in the “Enrollments” section near the bottom of the page. Click this link to start the enrollment.

Enter the course code: d6e01577-f458-497c-9543-e428dbcd4212

Submit the form.

Confirm that this is the correct course and then click the button to finish enrolling.

SAS On Demand Course Name : MSc DIT Data Mining

SAS Course code : d6e01577-f458-497c-9543-e428dbcd4212

Video – Demo of signing up to SAS OnDemand

Follow the SAS OnDemand Student Registration Handbook on how to register for a class.

It can take anything from 1 minute to 24hours for your account to become active.

If you have any issues with setting up your SAS OnDemand Account or using SAS Enterprise Miner then you will beed to contact SAS Support

Task 3

Download the SAS Enterprise Miner Java application.

Log into the SAS OnDemand Control Center.

Click on the Enterprise Miner link. This will download a small Java application to your machine.

 

 

 

 

 

You should save this file (to your desktop), as you can use just use this app from now on.

You do not need to download this file every time.

Task 4

Open the SAS Enterprise Miner application, login and create a project.
Create a SAS EM Project for your Lab work. Call it My Lab Work.

When you have created the project you are ready to start using SAS Enterprise Miner.

That’s all the lab work for this week.

What to prepare for next week

Make sure you have completed all the above steps.

Maybe go through them again and have the project set up for next weeks class.

Additional Reading Materials

Crisp-DM

Fayyad Paper on KDD

Introduction to Data Mining – Book Chapter

Data Mining in Business – Book Chapter

Data Mining A Closer Look – Book Chapter

CRISP-DM Guide

CRISP-DM Process Model

When Algorithms Control the World