Complete List of Terms and Definitions for AP Statistics Terms 2

Terms Definitions
1 unimodal having one mode
2 Se Standard deviation of residuals
Resistant Not affected by extreme values.
blinding any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups
trial the sequence of several components representing events that we are pretending will take place
Range The difference between the smallest and largest numbers in a data set
Correlation Correlation measures the strength and direction of the linear relationship between two quantitative variable. Correlation is usually written as r.
Suppose we have data on variables x and y for n individuals. The values for the first individ. are X1 and Y1, second individ. X2, Y2, and so on. The means and SD of the two variables are x-bar and Sx, for the x values and y-bar Sy for the y-values. The correlation r between x and y is:
r = [1/(n-1)][∑((x - xbar) / Sx)((y - ybar) / Sy)
r only measures straight-line relationships.
3 Placebo A treatment that has no effect
simulation models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model
distribution gives the possible values of the variable and the frequency or relative frequency of each value
extrapolation although linear models provide an easy way to predict values of y for a given value of x, it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted
slope the change in the predicted response variable for every unit increase in the explanatory variable
complement rule the probability that something will not happen is 1 - probability that it will happen
shape one of three ways to describe distributions; terms like 'normal, symmetric, skewed, binomial' describe this
Modified boxplot Outliers are plotted as isolated points; whiskers reach to the second-most outlying value.
Probability The proportion of times the event occurs in many repeated trials of a random phenomenon.
3 Undercoverage When part of a population is represented less tan another
Individuals The objects described by a set of data.
statistically significant when an observed difference is too large for us to believe that is is likely to have occurred naturally
census a sample that consists of the entire population
frequency table lists the categories for a categorical variable and displays the counts for each category
response variable values of this variable record the results of each trial with respect to what we were interested in
roundoff error the error that occurs when the percents don't add up to 100%
categorical variable places an individual into one of several groups or categories
Boxplot Center in a boxplot ends at the quartiles and spans the middle portion of the observations. Middle vertical line markes the M, whiskers extend to largest and smallest observations.
4 Confidence interval An interval used for estimating a parameter
Pth Percentile The value such that p percent of observations fall at or below it.
nonresponse bias bias introduced to a sample when a large fraction of those sampled fails to respond
clusters these form when there is a gap between the data
time pot type of plot used to measure a variable over time; time goes on x-axis
Standard normal distribution The SND is the normal distribution N(0,1) with mean 0 and SD 1. If a variable x has any normal distribution N(μ,σ) with mean μ and SD σ, then the standardized variable: z = (x - μ) / σ has the standard normal distribution.
3 Systematic Sample A sample drawn by taking every nth person
addition rule If A and B are disjoint events, the probability of A or B is P(A) + P(B)
simple random sampling design in which each set of n elements in the population has an equal chance of selection
Skewed to the left Left side of the histogram extends much farther out than the right side.
4 Law of Large Numbers The long run probability will approach its theoretical probability
degrees of freedom the number n-1 is called this of the variance or standard deviation
3 Simple Random Sample (SRS) sample size where each element has an equal chance of being selected
Density curve mean and median The median of a density curve is the equal-areas point, the point that divides the area under the curve in half. The mean of a density curve is the balance point, at which the curve would balance if made solid material. (For a symmetric density curve mean = median and are both in the middle.