FAQ : Data Mining

FAQ for Data Mining module

This page will contain Questions I’ve received about the module, the topics, labs, and assignment.

As we don’t have face-to-face classes, it can be challenging for students to ask questions. If we had typical class and lab time, students would be able to ask questions in an informal way. This helps them to get a better understanding of what we cover during the semester.

This FAQ attempts to address our new challenges in an online world where we don’t have those dynamic interactions.

Students can contact me with their questions. I will attempt to respond promptly with answers. To assist with knowledge sharing with the whole class, I will post the questions I receive on this page, along with the answers. [The names of the student asking the question will not be listed]

IMPORTANT: When I say, I will attempt to respond promptly, it means I will endeavor to respond within a day or two of getting the question. If I don’t respond as quickly as you would like, just remember I have other classes, task and roles to perform each day. If I haven’t responded with three days, then Yes Please get onto me and Gently remind me ūüôā

If you see a week with No questions, that means no one asked me a question.

IMPORTANT: It is important that students check this page regularly for new Q&A. There will be no notices posted to the class group when new Q&A are added.

IMPORTANT: Make sure you fill in the Class Sign-in google doc each week. You can take the class live or at a later time offline using the recorded videos. Just let me know on the Sign-in doc what you did.  The links to the sign-in doc are in the Brightspace module.

Week 0 – Course Admin

 

Week 1 – Introduction

Q: Where is the link for the SAS Software?

A: The links are given in week 2 notes/webpage. No need to worry about those now. But do check out the Java requirements for running SAS Enterprise Miner. That’s the only thing you really need to do this week.

Q: Do we have to use the our TU Dublin email address when registering with SAS?

A: You can use any email address you want.

Q: Do we need to install R and Python and why?

A: You will probably be installing R and Python as part of your other modules. There will be some lab exercises in a couple of weeks time on using R and Python to perform pattern discovery. I’ll give some sample code for you to use, and you can then use this on other data set.

Q: Can we use Teams instead of using Bongo?

A: Yes we can and we will use Teams from Week 2.  I will add a link in the Brightspace module to the Teams meeting. There will be separate ones for the Part-Time class (TU60) and for the Full-Time class (TU59).

IMPORTANT: Make sure you fill in the Class Sign-in google doc each week. You can take the class live or at a later time offline using the recorded videos. Just let me know on the Sign-in doc what you did.  The links to the sign-in doc are in the Brightspace module.

Week 2 – Life Cycle and Tasks

Q: Is this weeks class on Bongo or Teams?

A: From this week we will be using Teams for the online classes. It has better screen sharing and audio.  The links to Teams will be on the Brightspace module.

Q: How long does it take to receive the emails from SAS?¬† or¬† I haven’t received the emails from SAS?

Emails should arrive within a couple of minutes. If that doesn’t happen, check your Spam/Junk folder.¬† You might find the emails there.

Q: When I try to run the SAS Enterprise Miner App I get a JRE issue?

A: If you encounter this problem them you might have skipped or not completed the lab work from Week 1. Here is the link to the webpage. Scroll down the page until you get to the section about checking the version of Java you have installed. SAS give a recommended version, but if you are running a higher version of JRE, then that should be OK to use. You don’t have to down grade the version.

Q: When I run the SAS Enterprise Miner App the font size is very small?

A: This problem seems to only affect people using certain types of computers. There are a couple of possible solutions. I’ve a webpage with solutions students have used to solve this problem. There are two solutions given. The first one is the most recently solution.

Q: Do I need to download the SAS Enterprise Miner App each time I want to run it?

A: What’s the answer to every question in IT?¬† = It Depends!.¬† This is a difficult question to answer as it seems to be slightly different for each student.¬† The simplest answer is, Yes download it each time. By doing this you will get the latest version of the App with all updates, bug fixes, enhancements, etc.

Q: I run Linux on my computer. The App doesn’t seem to run or I get an error. How can I get this to work on Linux?

A: The following instructions have been provided by a student in the class. They are using Linux and had OpenJDK installed. The SAS Enterprise Miner App wouldn’t run/work and the following outlines the steps they followed to get SAS Enterprise Miner to work on their Linux machine.

– You need to change to Oracle’s Java rather than OpenJDK for the SAS Enterprise Miner.
steps:
After you setup the profile, stick in the course code and follow all the steps to get the main.jlnp
They ask you for an Oracle account to download ?
Download jdk-8u261-linux-x64.tar.gz

sudo mkdir -p /usr/lib/jvm

cd to folder where you downloaded jdk-8u261-linux-x64.tar.gz
sudo tar zxvf jdk-8u261-linux-x64.tar.gz -C /usr/lib/jvm
(you could set this Oracle Java as your default java as this stage, but I didn’t bother, mine is still OpenJDK)
cd to the directory with your main.jnlp and run:
/usr/lib/jvm/jdk1.8.0_261/bin/javaws main.jnlp

Week 3 – Data Preparation

 

Week 4 – Data Preparation

 

Week 5 – Association Rule Mining

 

Week 6 – Complete Lab Exercises

 

Week 7 – Classification

 

Week 8 – Classification

 

Week 9 – Clustering

 

Week 10 – Deployment of Data Science and Machine Learning

 

Week 11 – GDPR for Data Science & Ethics for Data Science and Machine Learning

 

Assignment

Q: Can we use RapidMiner or some other language/tool to process the data or for doing the modelling?

A: RapidMiner is not one of the languages or tools we have used in this module. Use what we have used during the semester and/or what is listed on the assignment handout

Q: For joint/group assignment work, we were wondering whether the assignment should be submitted by one of us on behalf of both of us or whether we each write up the report from our findings separately?

A: Only one submission is necessary. Make sure to clearly state the people who worked on the report.

Exam