In this context the expression potentially useful

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: ed models which represent the found connections. In this context, the expression “potentially useful” means that the samples to be found for an application generate a benefit for the user. Thus the definition couples knowledge discovery with a specific application. Knowledge discovery in databases is a process that is defined by several processing steps that have to be applied to a data set of interest in order to extract useful patterns. These steps have to be performed iteratively and several steps usually require interactive feedback from a user. As defined by the CRoss Industry Standard Process for Data Mining (Crisp DM1 ) model (crispdm and CRISP99 1999) the main steps are: (1) business understanding2 , (2) data understanding, (3) data preparation, (4) modelling, (5) evaluation, (6) deployment (cf. fig. 13 ). Besides the initial problem of analyzing and understanding the overall task (first two steps) one of the most time consuming steps is data preparation. This is especially of interest for text mining which needs special preprocessing methods to convert textual data into a format 1 2 3...
View Full Document

Ask a homework question - tutors are online