| Terms |
Definitions |
|
MSw
|
SSw/dfw
|
|
mode
|
Most common value
|
|
ORDINAL
|
Attributes can be ordered
|
|
Curvilinear
|
Correlation is ineffective with curvilinear because it works with linear relationships
-Make it effective by drawing line (range restriction)
|
|
discrete
|
numerical data sets where possible values are isolated points
|
|
population later?
|
india will surpass the us.
|
|
correlation ignores
|
distinction between explanatory and response variables.
|
|
No
|
Can you prove cause/effect with an observational study?
|
|
INTERVAL DATA
|
-QUANTITATIVE
-EXAMLE IS TEMP. DIFF B/C 60 DEG. AND 61 DEG. IS THE SAME AT 80 DEG. AND 81 DEG.
|
|
Arithmetic Scale
|
Distances on the Y-axis are proportional to the magnitude of the variable being displayed on this scale.
|
|
Outlier
|
An individual that falls outside the overall pattern
|
|
beta
|
the second letter of the Greek alphabet (β, B).
|
|
test statistic
|
statistic used to test a hypothesis
|
|
double blind procedure
|
an experimental procedure in which both the research participants and the research staff are ignorant (blind) about whether the research participants have recieved the placebo.
|
|
Expected Values
|
Expected Values = (valueof event1) x (prob of event 1) + (value of event 2)X(prob of event 2)
|
|
Define "mode."
|
The mode is the most frequently occurring score in a distribution
|
|
INFERENTIAL STATISTICS
|
Techniques that allow us to study samples and make generalizations or inferences about the populations from which they were selected
- Making Inferences
- Hypothesis Testings
- Determining Relationships
- Making Predictions
|
|
standard deviation
|
measure of dispersion in a frequency distribution
|
|
Statistically Significant
|
When an observed difference is too large to believe that it is likely to have occurred naturally
|
|
right skewed
|
values are more spread out to the right side
|
|
Frequency distribution
|
a tabular summary of data showing the frequency of items in each of several classes
|
|
adroit
|
expert or nimble in the use of the hands or body.
|
|
Construct validity is..
|
the degree to which the test actually tests what you want it to. the truthfulness of the measures
|
|
Continuous quantitative variables
|
-can take on any value along an interval
-applicable when there are no gaps between the exact values which these variables can take on, such as weight, height, volume, or distance.
|
|
How are you going to get your information from your sample?
|
Observational Study: observe individual and measure variables of interest but does not attempt to influence response. (e.g. stand back and watch)Experiment: imposing some type of treatment on individual in order to observe their response.Anecdotal Evidence: Not good science (e.g. Dateline)
|
|
Theory is a set of related -- about the causes of a -- and the -- that specify how - - --.
|
assumptions phenomenon, rules specific causes act.
|
|
the distance between two mean in standard deviation units.
|
When should you reject the null hypothesis?
|
|
How do you determine F-crit for comparisons?
|
You always use 1 for degrees of freedome in the numerator and the degrees of freedom for your Df s/a
|
|
MGF
|
E(e^(xt))
|
|
symmetric
|
mean = median
|
|
mutually exclusive
|
events that have no outcomes in common
|
|
QUANTITATIVE VARIABLES
|
Numeric
Can be represented numerically
(height, age)
|
|
dependent variable
|
the experimental factor-in psychology that is being measured; the variable that may change in response to manipulations of the independent variable
|
|
Pareto Chart
|
Displays categorical data, with categories displayed in descending order of frequency, so the most common categories appear first.
|
|
dependent variables
|
the outcomes that are measured
|
|
mean
|
this measure of center is not resistant
|
|
MODE
|
OCCURS MORE THAN ONCE AND MORE FREQUENTLY
|
|
Median
|
Middle value in sorted array. Pro: Good when extreme data values exist. Con: Ignores extremes and can be effected by gaps in data values.
|
|
direct assocaition
|
a positive direction or association means that, in general, as one variable increase, so does the other. When increase in one variable generally correspond to decreases in the other, the assocatiation is negative
|
|
first rule of data analysis
|
plot the data.
|
|
false consensus effect
|
the tendency to overestimate the extent to which others share our beliefs and behaviors
|
|
Qualitative data
|
Values that can be placed into nonumerical categories
|
|
Area Principle
|
In a statistical display, each data value should be represented by the same amount of area.
|
|
POPULATIONS
|
The entire set of individuals we are interested in studying
(ex) - SAT scores of incoming UM freshmen
|
|
deviation
|
difference between one of a set of values and some fixed value, usually the mean of the set
|
|
Census
|
A sample that consists of the entire population
|
|
condition for normal distribution
|
1.data values clustered near mean= single peaked2.Values spread evenly around mean making symmetric3.Large deviation from mean becomes incresingly rare= producing tapering tails4.Indiviual data results from comnination of many different factors such as genetic and enviormental factors
|
|
Cross-Sectional data
|
data collected at the same or approx the same point in time.
|
|
probability distribution
|
distribution of all values of a random variable with an indication of their probabilities
|
|
An interaction is...
|
...when the effect on one factor is not the same as the effect on all levels of another factor
|
|
STATISTICS
|
Values that describe a SAMPLE
(ex) the average SAT score for every
tenth freshman from an
alphabetical list of their last
names
|
|
Why are experiments better than observational studies?
|
Give good evidence for causation.Study the combined effects of several factors (interactions between factors can be very important)Control the effects of lurking variables (these get in the way of variables being studied)
|
|
Ethics of Experiments with Humans:
|
Planned studies need to be reviewed by board.All subjects must give their informed consent before data is collected.All individual data must be kept confidential. Only summaries can be made public.(Anonymity: researcher doesn't know subjects)
|
|
The standard deviation of the population.
|
What is another name for the standard deviation of the distribution of sample means?
|
|
Lower tailed test (Also called a left-tailed test):
|
A test with “<” in the alternative hypothesis. This is a one-sided test.
|
|
Quantitative
|
numerical value
|
|
INTERVAL
|
Distance is meaningful
|
|
Variable
|
any characteristic of an individual
|
|
multiplication rule
|
method for finding the probability that both of two events occur
|
|
Cluster Sample
|
Everyone in a group participates
|
|
Treatment
|
A condition applied to the experimental unit. i.e., a new drug is administered to patients.
|
|
variance
|
the square of the standard deviation.
|
|
Distribution
|
way values are spread over all possiable values
|
|
graphs
|
frequency always on y, variable always on x
|
|
Null Hypothesis
|
A hypothesis that the difference between two population means is zero or null.
Symbol for the Null Hypothesis = Ho
|
|
probability
|
likelihood of the occurrence of an event
|
|
Inference
|
Using results from a sample statistic value to draw conclusions about the population parameter.
|
|
control group
|
an experiment is the group of subjects who do not receive the treatment being tested
|
|
Extrapolation
|
The use of a regression line for prediction for outside the range of values of the explanatory variable x that you used to obtain the line. (Such predictions are often not accurate)
|
|
stacked bar graphs
|
compares the contribution of each value to a total across categories.
|
|
conditional probability
|
the probability that an event will occur under the condition that another event occurs first: equal to the probability that both will occur divided by the probability that the first will occur.
|
|
convenience sampling
|
choosing a sample due to ease of sampling
|
|
Steps in Statisitcal study
|
1 identify goal2 choose sample3 collect data4 use sample to make inferences5 draw conclusions
|
|
Bayes' Theorem
|
P(A|B) = ( P(B|A)P(A)) / P(B|A)P(A) + P(B|~A)P(~A)
|
|
range
|
set of all values attained by a given function throughout its domain
|
|
Left Skewed
|
Value are more spread out n the left side
|
|
Level of confidence:
|
The percent of the time that the confidence interval estimation procedure will give you intervals containing the value of the parameter being estimated. (Note: This can only be defined in terms of probability as follows: “The probability that the confidence interval to be computed (before data are gathered) will contain the value of the parameter.” After data are collected, level of confidence is no longer a probability because a calculated confidence interval either contains the value of the parameter or it doesn’t.)
|
|
For which types of scales can the mean be used?
|
Interval and Ratio
|
|
Why is it important to identify an outlier?
|
-might be incorrectly recorded value-might be a data value that was incorrectly included-might be a correctly recored data that belongs in data set
|
|
The standard deviation is the square root of the variance.Another answer: The variance is the standard deviation squared.
|
Name 2 advantages of using the standard deviation instead of the range as a measure of variability.
|
|
Why is the mean a poor measure of central tendency for a skewed distribution?
|
A skewed distribution has outliers in the tail. Outliers can make the mean unrepresentative of the distribution as a whole.
|