This preview has intentionally blurred sections. Sign up to view the full version.View Full Document
Unformatted text preview: The total entropy will be 0.2000 × (693/3204) + 1.8408 × (1562/3204) + 1.6964 × (949/3204) = 1.4431. Q3. (2 points) When the squared error criterion is used, outlier can unduly influence the clusters that are found. In particular, when outliers are present, the resulting cluster centroids may not be as representative as they otherwise would be and thus, the SSE will be higher as well. But for density-based clustering method, such as DBSCAN, the outliers will be labeled as noise points and then eliminated, so they will not influence the clustering results....
View Full Document
- Spring '11
- representative, DBSCAN