| Terms |
Definitions |
|
MSwithin reflects...
|
error
|
|
Homogeneous
|
Similar in makeup
|
|
Systematic Sample
|
Every "nth" person participates
|
|
null hypothesis
|
hypothesis to be tested
|
|
Multicollinearity
|
x values are correlatedcan cause problemsdon't assume they are and then throw them outIf the sign is backwards (-) means they may be tied togetherWill have to illiminate one and keep the othersIf you leave it, it inflates the standard error of the estimate
|
|
1.They help us identify outliers.2.They tell us how well the measure of central tendency summarizes the entire distribution.3. They are used to compute other statistics.4.They can be theoretically important in and of themselves
|
Define range
|
|
regression
|
process of finding a regression equation
|
|
Standard Normal Curve/ Distribution
|
symmetrical, bell-shaped, central tendency to middle, most fall b/w -1 & +1
|
|
five number summary
|
Distribution consists of the following 1 lower value2 lower value3 median4 upper Quartil5 high quartil
|
|
Case-Control Study
|
A retrospective study in which subjects who have a response outcome of interest (the cases) and subjects who have the other response outcome (the controls) are compared on the explanatory variable.Ex. The brain cancer-cell phone usage example.
|
|
Response error
|
error due to respondents reporting information that is not truthful, and nonresponse error is error due to differences between those who responded to the survey and those who did not respond.
|
|
multiplication rule
|
method for finding the probability that both of two events occur
|
|
Control Group
|
Experimental units assigned to a baseline treatment level
|
|
how would being wider affect a curve
|
larger variability
|
|
relative frequency
|
proportion or percentage of the data value that falls in categoryRelative Freq= Freq in categoriey ------------------- total freq
|
|
Sample
|
the part of the population from which we actually collect information used to draw conclusions about the whole
|
|
Statistic
|
A characteristic of a sample; a number computed from sample data (without any knowledge of the value of a parameter) used to estimate the value of a parameter. Examples include x-bar , the sample mean, and s, the sample standard deviation.
|
|
population
|
all the cases in a group from which samples maybe drawn for a study
|
|
Research is important if 3. Unimportant if 3.
|
1)Clarify relationship amoung variables 2) Support only one hypothesis 3) answer leads to practical application. 1) answer already firmly established 2)variables are known to have a small effect/ theoritcal interest 3)variables not believed to be causally related
|
|
Correlated Groups t test
|
•Typically used to analyze the relationship between two variables under the following conditions:
oIf the dependent variable is quantitative in nature in nature and is measured on a level that at least approximates interval characteristics
oThe independent variable is within subjects in nature
oThe independent variable has two, and only two, levels
|
|
range
|
set of all values attained by a given function throughout its domain
|
|
Symmetric Data
|
The mean and the median are about the same.
|
|
law of large numbers
|
the theorem in probability theory that the number of successes increases as the number of experiments increases and approximates the probability times the number of experiments for a large number of experiments.
|
|
population mean
|
the mean of data set is the average of all of the values
|
|
The One-Sample t Test
|
The One-Sample t test should be used in place of a z score conversion when the standard error of the mean (σ) is unknown.
If you look at the equation to calculate a t score, it is almost exactly the same as the z score conversion, with the exception of the denominator, in which you use the estimated standard error of the mean
|
|
Why do we need a null hypothesis
|
In behavioral research, you can never actually prove that something is true, but we can prove that it’s false.
Provides a starting point for any statistical test
|
|
General idea of what outliers do to your data
|
•If you have 1 outlier compare it with the data and compare it without during correlation
•Can cause a strong correlation to appear weak and vice versa
|
|
Binomial Variance
|
np(1-p)
|
|
Class width
|
max-min/number of classes
|
|
MSbetween reflects...
|
treatment effects + error
|
|
factor
|
Another term for explanatory variable.
|
|
Ratio scales
|
Magnitude, equal intervals, absolute zero
|
|
Elements
|
entities on which data are collected
|
|
Unexplained variation
|
the sum of squared residuals
|
|
Hawthorne Effect
|
Subjects change their behavior because they know they are in an experiment
|
|
line chart
|
distrobutions of quantitative data as a series of dots onnected by lines
|
|
The 11thperson was probably an outlier, possibly yielding misleading results.
|
Define regression
|
|
census
|
attempt to contact every individual in the entire population
|
|
independent variable
|
variable whose value determines the value or values of other variables
|
|
Experiments
|
Study in which subjects are randomly assigned to treatments
|
|
Trend
|
Long term, more than a year (t)=time variable=is used to find trent
|
|
Symmetric
|
If neither tail of the histogram is longer than the other.
|
|
SS total
|
the total variability in the dependent variable
|
|
Matched pairs
|
Either two measurements are taken on each individual such as pre and post OR two individuals are matched by a third variable (different from the explanatory variable and the response variable) such as identical twins or windows matched by installer when comparing installation time of two brands of windows.
|
|
individual
|
the basic unit (or subject) of the experiment upon which a treatment is applied
|
|
cicero is doing a study to find the average life of indoor cats vs outdoor cats. what type is he performing?
|
t-test
|
|
Coefficient of Variation
|
Used to compare dispersion in data sets with dissimilar sets of measurement. It is the the standard deviation expressed as a percent of the mean.
|
|
Unbiased Estimator
|
A sample mean is an Unbiased Estimator of the population mean
An unbiased estimator of a population parameter is a statistic whose mean across all possible random samples of a given size equals the value of the parameter.
(There are going to be some higher and some lower so it all equals out)
|
|
marginal Distribution
|
The distribution of the values in the "total" row (or the "total" column) of a two-way table.
|
|
Crossover Design
|
A type of matched pair design in which subjects \"cross over\" during the experiment from using one treatment to using another treatment.
|
|
Sample mean, x bar
|
the random variable of the sampling distribution of x bar.
|
|
a systematic sample
|
since we are selecting every k = 13th invoice.
|
|
The probability of rejecting a false null hypothesis; computed as 1 – β. Increase power by increasing sample size.
|
A difference between the observed statistic and the claimed parameter value that is large enough to be worth reporting. To assess practical significance, look at the numerator of the test statistic and ask ‘Is it worth anything?’ If yes, then results are also of practical significance. Note: Do not assess practical significance unless results are statistically significant.
|
|
Sampling Distribution of x bar
|
A list of all the possible values for x bar together with the frequency (or probability) of each value; in other words, the distribution of all x bars from all possible samples.
|
|
rooney is using a ztest to determine if the new turtle feed is worth the extra money. he already gathered raw scores. what is his next step
|
to find the distance from the mean and to find the zscore
|