{[ promptMessage ]}

Bookmark it

{[ promptMessage ]}

cs345-streams3-2

# cs345-streams3-2 - Still More Stream-Mining Frequent...

This preview shows pages 1–7. Sign up to view the full content.

1 Still More Stream-Mining Frequent Itemsets Elephants and Troops Exponentially Decaying Windows

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
2 Counting Items rhombus6 Problem : given a stream, which items appear more than s times in the window? rhombus6 Possible solution : think of the stream of baskets as one binary stream per item. rhombus4 1 = item present; 0 = not present. rhombus4 Use DGIM to estimate counts of 1’s for all items.
3 Extensions rhombus6 In principle, you could count frequent pairs or even larger sets the same way. rhombus4 One stream per itemset. rhombus6 Drawbacks: 1. Only approximate. 2. Number of itemsets is way too big.

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
4 Approaches 1. Elephants and troops ”: a heuristic way to converge on unusually strongly connected itemsets. 2. Exponentially decaying windows : a heuristic for selecting likely frequent itemsets.
5 Elephants and Troops rhombus6 When Sergey Brin wasn’t worrying about Google, he tried the following experiment. rhombus6 Goal : find unusually correlated sets of words. rhombus4 High Correlation ” = frequency of occurrence of set >> product of frequency of members.

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
6 Experimental Setup rhombus6 The data was an early Google crawl of the Stanford Web.
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}

### What students are saying

• As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.

Kiran Temple University Fox School of Business ‘17, Course Hero Intern

• I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.

Dana University of Pennsylvania ‘17, Course Hero Intern

• The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.

Jill Tulane University ‘16, Course Hero Intern