lec4-clustering

lec4-clustering - Distributed Computing Seminar Lecture 4:...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon
Distributed Computing Seminar Lecture 4: Clustering – an Overview and Sample MapReduce Implementation Summer 2007 Except as otherwise noted, the content of this presentation is © 2007 Google Inc. and licensed under the Creative Commons Attribution 2.5 License.
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Outline Clustering Intuition Clustering Algorithms The Distance Measure Hierarchical vs. Partitional K-Means Clustering Complexity Canopy Clustering MapReducing a large data set with K-Means and Canopy Clustering
Background image of page 2
Clustering What is clustering?
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Google News They didn’t pick all 3,400,217 related articles by hand… Or Amazon.com Or Netflix…
Background image of page 4
Other less glamorous things. .. Hospital Records Scientific Imaging Related genes, related stars, related sequences Market Research Segmenting markets, product positioning Social Network Analysis Data mining Image segmentation…
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
The Distance Measure How the similarity of two elements in a set
Background image of page 6
Image of page 7
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 53

lec4-clustering - Distributed Computing Seminar Lecture 4:...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online