Lect28-Associat

Lect28-Associat - DATA MINING Susan Holmes Stats202 Lecture...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon
. . . . . . DATA MINING Susan Holmes © Stats202 Lecture 28 Fall 2010 ABabcdfghiejkl
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
. . . . . . Special Announcements I Do not update your version of R before the end of the quarter. I All requests should be sent to stats202-aut1011-staff@lists.stanford.edu . I Homework is due today, (no R, but practice questions for the Fnal). I Kaggle competition is cooking along, site: http://inclass.kaggle.com/stat202 and 12 teams competing. I This week is the last active week, with ofFce hours and response to email, deadweek there is no email and no ofFce hours, please plan accordingly.
Background image of page 2
. . . . . . Last times Cluster Evaluation. Probabilistic Clustering. EM algorithm. Graph sparsifcation. Missing Data. Clustering in R.
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
. . . . . . Mining Association Data Looking for patterns in categorical data, shopping lists, etc. .. Categorical data in sets, each observation could be I One shopping expedition. I One customer proFle. I Computer transactions. I A microarray of particular tissue or patient. I A Document, email or blog. Based on dependence of categorical variables. If buying onions is independent of buying burgers there is no point in exploring the association because we would just have: P ( buy burgers and onions ) = P ( buy burgers ) × P ( buy onions )
Background image of page 4
. . . . . . The data can be visualized as a table where the rows are the transactions and asymmetric binary variables are the columns corresponding to the different items . ID beer diapers bread milk butter soda tuna 1 1 1 1 1 0 0 0 2 0 1 1 1 0 0 0 3 0 1 0 0 0 1 0 4 1 0 1 1 0 0 1 5 0 0 1 0 1 0 1 6 0 1 1 1 1 0 1 Here the items are I = { beer , diapers , bread , milk , butter , soda , tuna }
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
. . . . . . ASSOCIATION RULES A rule is defned as an implication oF the Form X Y where X , Y I and X Y = . The sets oF items (For short itemsets) X and Y are called antecedent (leFt-hand-side or LHS) and consequent (right-hand-side or RHS) oF the rule respectively. The support
Background image of page 6
Image of page 7
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 18

Lect28-Associat - DATA MINING Susan Holmes Stats202 Lecture...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online