WittenFrankChapter4Part4Prism (2)

# WittenFrankChapter4Part4Prism (2) - Data Mining Algorithms...

This preview shows pages 1–11. Sign up to view the full content.

Data Mining – Algorithms: Prism – Learning Rules via Separating and Covering Chapter 4, Section 4.4

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Rules Can be directly read off a decision tree – but those might not be the most compact or effective rules Common approach – take each class in turn and find a way of “covering” all instances in it, while excluding instances not in the class
Let’s use My Weather Data Again Again, Let’s take this a little more realistic than book does Divide into training and test data Let’s save the last record as a test (using my weather, nominal … and assuming we’re working on the play?=yes class first … We’re looking for a rule in the form if ___ Then play? = yes Possible ways of filling include: Outlook = sunny Outlook = overcast Temperature = hot

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Find the best filler using training data We look at proportion of instances that match the left hand side that also match the right hand side LHS Matches LHS Of those, Match RHS Ratio Outlook = sunny 5 4 .80 Outlook = Overcast 4 2 .50 Outlook = Rainy 4 0 .00 Temp = Hot 4 1 .25 Temp = Mild 5 3 .60 Temp = Cool 4 2 .50 Humid = High 6 3 .50 Humid = Normal 7 3 .43 Windy = TRUE 5 4 .80 Windy = False 8 2 .25
Refining Rule If this rule is not accurate enough for us (based on a threshold), we’re going to try to refine it by adding a clause(s) Now, we’re looking to fill in a clause in the following: if outlook = sunny and _____ then play? = yes We consider the accuracy of all possible ways of filling this blank …

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Find the best filler using training data We look at proportion of instances that match the left hand side that also match the right hand side LHS Matches LHS Of those, Match RHS Ratio Outlook = Sunny & Temp = Hot 2 1 .50 Outlook = Sunny & Temp = Mild 2 2 1.00 Outlook = Sunny & Temp = Cool 1 1 1.00 Outlook = Sunny & Humid = High 3 2 .67 Outlook = Sunny & Humid = Normal 2 2 1.00 Outlook = Sunny & Windy = TRUE 2 2 1.00 Outlook = Sunny & Windy = False 3 2 .67
Still more to cover though This rule only covers 2 of the 6 play=yes days This approach looks more for pockets of a success whereas ID3 is looking more at the big picture So we temporarily toss those 2 instances and work on another rule

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Example: My Weather (Nominal) Outlook Temp Humid Windy Play? sunny hot high FALSE no sunny hot high TRUE yes overcast hot high FALSE no rainy mild high FALSE no rainy cool normal FALSE no rainy cool normal TRUE no overcast cool normal TRUE yes sunny cool normal FALSE yes rainy mild normal FALSE no overcast mild high TRUE yes overcast hot normal FALSE no rainy mild high TRUE no TEST
We’re Looking for another rule … in the form if ___ Then play? = yes Again, possible ways of filling include: Outlook = sunny Outlook = overcast Temperature = hot However, our data is a little different now

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Find the best filler using training data We look at proportion of instances that match the left hand side that also match the right hand side LHS Matches LHS Of those, Match RHS Ratio Outlook = sunny 3 2 .67 Outlook = Overcast 4 2 .50 Outlook = Rainy 4 0 .00
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}

### What students are saying

• As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.

Kiran Temple University Fox School of Business ‘17, Course Hero Intern

• I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.

Dana University of Pennsylvania ‘17, Course Hero Intern

• The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.

Jill Tulane University ‘16, Course Hero Intern