{[ promptMessage ]}

Bookmark it

{[ promptMessage ]}

intro - Data Mining CS57300 Purdue University Introduction...

Info iconThis preview shows pages 1–8. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Data Mining CS57300 Purdue University August 24, 2010 Introduction • What is data mining? • Why now? • Data mining process • Example What is data mining? “... the non-trivial extraction of implicit, previously unknown, and potentially useful information from data.” Frawley, Piatetsky-Shapiro, and Matheus (1992) “... a new paradigm that focuses on computerized exploration of large amounts of data and on discovery of relevant and interesting patterns within them.” Feldman and Dagan (1995) What is data mining? Databases Artifcial Intelligence Visualization Statistics Data Mining Also known as: knowledge discovery , exploratory data analysis, applied statistics, machine learning Why now? http://images.forbes.com/images/forbes/2004/1213/113_chart.gif How much information? Lyman and Varian, UCBerkeley (2003) • ~5 exabytes of new information stored in 2002 • 1 Exabyte = 1000 petabytes = 1 mil terabytes = 1 bil gigabytes • The amount of new information stored has about doubled in the last three years • Almost 18 exabytes of information Fowed through electronic channels in 2002 • 98% percent of this total is the information sent and received in telephone calls Data mining process adapted from: U. Fayyad, et al. (1995), “From Knowledge Discovery to Data Mining: An Overview,” Advances in Knowledge Discovery and Data Mining, U. Fayyad et al. (Eds.), AAAI/MIT Press Data Target Data Selection Knowledge Knowledge Preprocessed Data Patterns Data Mining Interpretation/ Evaluation Preprocessing Data mining process...
View Full Document

{[ snackBarMessage ]}

Page1 / 28

intro - Data Mining CS57300 Purdue University Introduction...

This preview shows document pages 1 - 8. Sign up to view the full document.

View Full Document Right Arrow Icon bookmark
Ask a homework question - tutors are online