class_09_03 - Statistical Data Mining ORIE 474 Fall 2007...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon
Statistical Data Mining ORIE 474 Fall 2007 Tatiyana V. Apanasovich 09/03/07 Visualizing Data
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
3. Visualizing and Exploring Data Summarizing Data Tools for Displaying Single variables Relationships between two variables More than two variables Principal Component Analysis
Background image of page 2
3.2 Summarizing Data Suppose that x(1),…,x(n) is a set of n data values Relevant sample statistics are: Location measures: Mean Median and Quartiles Mode Dispersion or variability measures: Standard deviation and variance Interquartile range and range Skewness
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
3.2 Summarizing Data (cont’d) Ex: 100 data points sampled from a normal distribution with
Background image of page 4
Sample Mean Sample Mean: Ex: (whereas µ=0) Location measure Sample mean is the value that is “central” in the sense that it minimizes the sum of squared differences between it and the data: Proof: = i i x n ) ( 1 ˆ μ 36 . 1 ˆ = ( 29 ( 29 = = - = - = - i i i i i x n a na i x a i x da a i x d ) ( 1 0 ) ( ) ( 2 ) ( 2 ( 29 2 ) ( min ˆ - = i a a i x
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Sample Median and Mode Sample median = value that has an equal number of data points above or below it If, as in our example, n is even, it is usually defined as the halfway between the 2 middle values Ex: (whereas m=0) Location measure
Background image of page 6
Image of page 7
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 23

class_09_03 - Statistical Data Mining ORIE 474 Fall 2007...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online