INFS4203 / INFS7203 Data Mining 2008 Copyrights © 2008. Gabriel Fung. All Rights Reserved. P. 1 Solution for assignment 3 Question 1 Let: A Apple, B Banana, C Carrot, D detergent, E Egg, F Fish Transaction Items 1 F A D B 2 D A C E B 3 C A B E 4 B A D Step 1: Count 1-Item Frequency: Item Freq A 4 (>60%) B 4 (>60%) C 2 D 3 (>60%) E 2 F 1 Step 2: Generate Candidate: Item Freq A 4 B 4 D 3 Step 3: Generate 2-Item Frequency: Item Freq AB 3 BD 3 AD 3 Step 4: Generate Candidate: Item Freq AB 3 BD 3 AB 3 Step 5: Generate 3-Item Frequency: Item Freq ABD 3 Rule Confidence:

Unformatted text preview: A  B 75% B  A 75% A  D 75% D  A 100% B  D 75% D  B 100% A  BD 75% BD  A 100% B  AD 75% AD  B 100% INFS4203 / INFS7203 Data Mining 2008 Copyrights © 2008. Gabriel Fung. All Rights Reserved. P. 2 D  AB 100% AB  D 100% Conclusion: Return all of the above rules with conf > 80%. Solution 2 Step 1: Ordered Frequency for the Items: Item Freq A 4 B 4 D 3 C 2 E 2 F 1 Step 2: Create the tree step by step A:1 Root B:1 D:1 Root Root C:1 E:1 F:1 A:2 B:2 D:2 F:1 C:1 E:1 A:3 B:3 D:2 F:1 C:1 E:1 Root A:4 B:4 D:3 F:1 C:1 E:1 C:1 E:1...
