This preview shows page 1. Sign up to view the full content.
Unformatted text preview: ng approaches, but we had to leave out many other. Obviously there are many
ways to deﬁne clusters and because of this we cannot expect to obtain something like the ‘true’ clustering. Still clustering can be insightful. In contrast to
classiﬁcation, which relies on a prespeciﬁed grouping, cluster procedures label
documents in a new way. By studying the words and phrases that characterize
a cluster, for example, a company could learn new insights about its customers
and their typical properties. A comparison of some clustering methods is given
in Steinbach et al. (2000).
The Utility of Clustering 3.3 Information Extraction Natural language text contains much information that is not directly suitable
for automatic analysis by a computer. However, computers can be used to
sift through large amounts of text and extract useful information from single
words, phrases or passages. Therefore information extraction can be regarded as
a restricted form of full natural language understanding,...
View Full Document
- Summer '11