A variable is quantitative if the values it takes are numeric values.
The values of a quantitative variable represent different magnitudes
of that variable.
What are examples of quantitative variables?
Finding Z values
Many inferential methods use z-values corresponding to certain
probabilities from a normal distribution. This entails the reverse use
of the z-table.
Starting with a lower tail probability (the probability of being less
than), which is li
Analyzing association between
Recall, there is an association between two variables if the
distribution of the response variable changes in some way as the
value of the explanatory variable changes.
We are now going to learn methods
for a mean
For quantitative variables, significance tests usually refer to the
The null hypothesis has the form Ho: = o, where o is a particular
value for the population mean.
The hypothesized value in Ho is a particular
We will define nij as the number of observations in row i and in
column j. Thus n21 = 150.
We define ni+ as the numb
Example: The subjects in the previous study were asked to
rate their hunger at the end of each two week period.
Find the P-value for a paired sample t
Example: The subj
test of independence
If the null is true, the expected cell counts ij should be close to
the observed cell counts nij.
The larger the errors, the greater the evidence against the
The Pearson Chi-square test of independe
As with means, hypothesis tests between two proportions generally
are testing if the proportions are the same. That is we generally
Ho: p1 = p2
This is again equivalent to testing
Ho: p1 - p2 = 0
To calculate the p-value, we
We discussed before how to compare the means of two groups, 1
We will now discuss comparing the means of several groups.
The mean of a quantitative response variable is compared among
groups that are categories (or levels) of an explanatory v
No matter what distribution the error terms i have (and hence the
Yi), the least squares method provides unbiased point estimates of
0 and 1.
These estimates have minimum variance among all unbiased
However, to do con
The F statistic is the ratio of the estimated variance between groups
over the estimated variance within groups.
If the estimated variance between groups is large compared to the
estimated variance within groups, the F statistic will be large.
Relations between variables
A functional relationship between two variables is expressed by a
For an independent variable X and a dependent variable Y, a
functional relationship has the form
For example, if we sell widgets fo
One assumption of the ANOVA models is that the standard
deviation is the same for each group. This common standard
deviation is denoted by . Thus we assume that
1 = 2 = . = I =
homoskedasticity: The variances for all the groups are equal
Example: Two-week weight gains (lbs) of lambs on three different
diets 1: State H and H
SS(within ) 210
df(within ) 9
MS(within ) 23.333
Step 6: Calculate the between sums
Like a confidence interval for a proportion, the confidence interval for a
mean has the form: point estimate margin of error.
The margin of error for is z1-/2* y(bar) , where
Thus, in the long run
y 1.96 n
To compare two populations, we estimate the difference between their
To compare population means 1 and 2, we treat 1 - 2 as the
The point estimate of this parameter is the difference in the sample
for a proportion
When the sample size is large, the sampling distribution of p(hat) is
approximately normal with mean p, the true parameter.
We are interested in testing a null hypothesis of the form Ho: p=p o
The alternative can be eith
Definitions relating samples to populations
of a population
(e.g. true mean,
of a sample
(e.g. sample mean,
A statistic is an estimate, based on a sample, of an un
APD did a study of the relationship between the number of
accidents at intersections and panhandlers.
They found that there were a high number of accidents at
intersections that had a high number of panhandlers.
They concluded that there needed to
Good graphics boxplots
Boxplot: visual representation of the five number
This distribution is
skewed to the:
Where would the mean be?
Good graphics boxplots
Boxplot: visual representation of t
Probability: With a random sample or a randomized experiment, the
probability an observation has a particular outcome is the proportion of
times that outcome would occur in a very long sequence of observations .
Probability helps us describe the outcome o
For Categorical data, the following conditions are often true:
Each observation falls into one of two categories (success or
failure, heads or tails)
The probabilities for the two categories are the same for each
variability of data
A measure of center alone is not adequate for numerically describing
data for a quantitative variable.
It describes a typical value, but not the spread of the data about that
Consider two nations, A and B. Bot
We want to estimate parameters from our data. A point estimate is
a single number that is used to estimate the parameter.
An interval estimate is an interval of numbers around the point
estimate, within which the parameter value is believ
The purpose of descriptive statistics is to summarize data in order to
make it easier to assimilate the information.
Tables and graphs are used to show the number of times various
For categorical data, the categories
Confidence intervals: Locating an invisible man!
The Normal Distribution
The empirical rule applies to var
Note the mean of y(bar) is the same as the mean
of the true population. That is the mean of y(bar)
is the same as the mean of an observation.
Note the standard deviation of y(bar) is
smaller than the standard deviation of the
We would not
In statistics, a hypothesis is a statement about a population. It is
usually a statement that a parameter takes a particular numerical
value or falls in a certain range of values.
A significance test uses data to summarize the evidence
for a mean
When the alternative hypothesis is one sided, technically the null
hypothesis is as well.
The null and alternative hypotheses are compliments and comprise
all possibilities, such as it is raining vs it is not raining or the me