intro - Data Mining CS57300 Purdue University August 24,...

Info iconThis preview shows pages 1–8. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Data Mining CS57300 Purdue University August 24, 2010 Introduction What is data mining? Why now? Data mining process Example What is data mining? ... the non-trivial extraction of implicit, previously unknown, and potentially useful information from data. Frawley, Piatetsky-Shapiro, and Matheus (1992) ... a new paradigm that focuses on computerized exploration of large amounts of data and on discovery of relevant and interesting patterns within them. Feldman and Dagan (1995) What is data mining? Databases Artifcial Intelligence Visualization Statistics Data Mining Also known as: knowledge discovery , exploratory data analysis, applied statistics, machine learning Why now? http://images.forbes.com/images/forbes/2004/1213/113_chart.gif How much information? Lyman and Varian, UCBerkeley (2003) ~5 exabytes of new information stored in 2002 1 Exabyte = 1000 petabytes = 1 mil terabytes = 1 bil gigabytes The amount of new information stored has about doubled in the last three years Almost 18 exabytes of information Fowed through electronic channels in 2002 98% percent of this total is the information sent and received in telephone calls Data mining process adapted from: U. Fayyad, et al. (1995), From Knowledge Discovery to Data Mining: An Overview, Advances in Knowledge Discovery and Data Mining, U. Fayyad et al. (Eds.), AAAI/MIT Press Data Target Data Selection Knowledge Knowledge Preprocessed Data Patterns Data Mining Interpretation/ Evaluation Preprocessing Data mining process...
View Full Document

This note was uploaded on 03/13/2012 for the course CS 573 taught by Professor Staff during the Fall '08 term at Purdue University-West Lafayette.

Page1 / 28

intro - Data Mining CS57300 Purdue University August 24,...

This preview shows document pages 1 - 8. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online