week2class - Quick Review Describing Distributions...

Info iconThis preview shows pages 1–9. Sign up to view the full content.

View Full Document Right Arrow Icon
Quick Review
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Describing Distributions Numerically Measures of Center: The mean or arithmetic average To calculate the average, or mean, add all values, Heights of 25 Women then divide by the number of observations. Sum of heights is 1598.3 Divided by 25 women = 63.9 inches Your numerical summary must be meaningful The distribution of the women’s height appears coherent and symmetrical. The mean is a good numerical summary. Height of 25 women in a class
Background image of page 2
Here, reporting a single numerical summary would not make sense in discribing the overall pattern of the data because there are really three seperated distribution. Measures of Center: The median The median is the midpoint of a distribution—the number such that half of the observations are smaller and half are larger. Here the shape of the distribution is wildly irregular. Why? Multimodal
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Comparing the mean and median: The mean and the median are roughly equal only if the distribution is symmetric. The median is a measure of the center that is resistant to skew and outliers. The mean is not resistant to such deviant values. mean= meadian n=25 (n+1)/2=26/2=13 Median =3.4 2. If n is odd, the median is observation ( n +1)/2 down the list 1. Sort observations from smallest to largest. n = number of observations ______________________________ N=24 n/2=12 Median=(3.3+3.4)/2=3.35 3. If n is even, the median is the mean of the two center observations Mean and median for a symmetric distribution
Background image of page 4
mean < median median < mean Mean and Median of a distribution with outliers Mean and median for a distribution that is skewed left or negatively skewed Mean and median for a distribution that is skewed right or positively skewed
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Impact of skewed data Percent of people dying Without the outlier With the outlier The median, on the other hand is only (pulled to the right a little). The mean is pulled to the right a lot by the outliers. (From 3.4 to 4.2)
Background image of page 6
Mean vs. Median Mean: - easy to calulate - easy to work with algebraically - highly affected by outliers - not a resistant measure Median: - can be time consuming to calculate - more resistant to a few extreme observations Measures of Center: The mode The most frequent value in the data Important for categorical data Possible to have more than one mode Mean and median of a symmetric distribution Mean and median of a skewed distribution
Background image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Mean, Median, Mode If the distribution is exactly symmetric (normal), the mean, the median and the mode are exactly the same. If the distribution is skewed, the three measures differ. The choice for describing the center of a distribution depends heavily on the shape of the distribution and the type of data (quantitative or qualitative). Skewed Data:
Background image of page 8
Image of page 9
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 34

week2class - Quick Review Describing Distributions...

This preview shows document pages 1 - 9. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online