This preview shows pages 1–2. Sign up to view the full content.
This preview has intentionally blurred sections. Sign up to view the full version.View Full Document
Unformatted text preview: The total entropy will be 0.2000 (693/3204) + 1.8408 (1562/3204) + 1.6964 (949/3204) = 1.4431. Q3. (2 points) When the squared error criterion is used, outlier can unduly influence the clusters that are found. In particular, when outliers are present, the resulting cluster centroids may not be as representative as they otherwise would be and thus, the SSE will be higher as well. But for density-based clustering method, such as DBSCAN, the outliers will be labeled as noise points and then eliminated, so they will not influence the clustering results....
View Full Document
This note was uploaded on 03/29/2011 for the course SEEM 463 taught by Professor Hongcheng during the Spring '11 term at CUHK.
- Spring '11