This preview shows page 1. Sign up to view the full content.
Unformatted text preview: Andreas Hotho, Andreas Nürnberger, and Gerhard Paaß A Brief Survey of Text Mining The enormous amount of information stored in unstructured texts cannot simply be used for further processing by computers, which typically
handle text as simple sequences of character strings. Therefore, speciﬁc
(pre-)processing methods and algorithms are required in order to extract
useful patterns. Text mining refers generally to the process of extracting
interesting information and knowledge from unstructured text. In this article, we discuss text mining as a young and interdisciplinary ﬁeld in the
intersection of the related areas information retrieval, machine learning,
statistics, computational linguistics and especially data mining. We describe
the main analysis tasks preprocessing, classiﬁcation, clustering, information
extraction and visualization. In addition, we brieﬂy discuss a number of
successful applications of text mining. 1 Introduction As computer networks become the backbones of science and economy enormous quantities of machine readable documents becom...
View Full Document
This note was uploaded on 06/19/2011 for the course IT 2258 taught by Professor Aymenali during the Summer '11 term at Abu Dhabi University.
- Summer '11