LAB ACTIVITY 1
(Use Class Survey Dataset in Angel.
Lab Quiz 1 is on activities 1 4.)
Activity 1: Types of variables
a. Identify the following variables as quantitative or categorical. (If quantitative,
state whether it is continuous or not. If categorical
Activity 1 Solution
Problem 1 (BY HAND type in your work):
Consider the following three sets of sample observations:
Set 1: 8, 9, 10, 11, 12
Set 2: 8, 9, 10, 11, 100
Set 3: 8, 9, 10, 11, 1000
A) Find the median and IQR for each data set.
Sampling Distributions (4 questions)
1. Using a college cost estimator, the average of campus rental at Penn State is approximately $547 per
month. Suppose these rates are roughly normal with a standard deviation of $100. Say you take a
Continuous Distributions (4 questions)
1. A World Health Organization study of health in various countries reported that in Canada, the systolic
blood pressure readings (i.e. this is the top number of your blood pressure e.g. the 120 if your bp
From lab 1:
A picture of all the
calculated for the
average word length
of the Gettysburg
Actual average word length:
Did we have biased samples? How
How could we get a true random
Statistics 200: Lab Activity 5
Learning objectives from Lecture / Lab 5:
Recognize the uses and meaning of the standard deviation.
Compute and interpret a z-score; use the 95% rule.
Use range, interquartile range, and standard deviation as measures of sca
Statistics 200: Lab Activity 3
Learning objectives from Lecture / Lab 3:
Use information about a categorical variable to find a proportion, with correct notation.
Display information from a categorical variable in a table or graph.
Display information abo
Statistics 200: Lab Activity 4
Learning objectives from Lecture / Lab 4:
Use a dotplot, histogram, or boxplot to describe the shape of a distribution.
Find the mean and median for a set of data values, with appropriate notation.
Identify the approximate l
Statistics 200: Lab Activity 2
Learning objectives from Lecture / Lab 2:
Recognize that not every association implies causation
Distinguish between experiments and observational studies, recognizing that only randomized
experiments can lead to claims of c
Statistics 200: Lab Activity 6
Learning objectives from Lecture / Lab 6:
Describe an association displayed in a scatterplot.
Explain what a positive or negative association means between two variables.
Use technology to find a correlation, and regression
Statistics 200: Lab Activity 8
Learning objectives from Lecture / Lab 8:
Use a description of a study to evaluate the method of data collection and to identify relevant
Use the method of data collection to determine what inferences might be poss
Statistics 200: Lab Activity 7
Learning objectives from Lecture / Lab 7:
Calculate predicted values from a regression line
Calculate residuals and visualize residuals on a scatterplot
Beware of extrapolating too far out when making predications.
Which is most likely true when considering
salaries($) in a company that employs 20
factory workers and 2 very highly paid
One would find with the salaries that the
mean > median
mean < median
I have no
The correlation for this
scatterplot is closest to
E. I dont know
VARIABLES LINEAR REGRESSION
mum g .88th uﬁ B om RE»? mg mm anon
w a “6m 3% 5:3 ow ﬁgs? 59m mﬁﬁ Em
2E .Eoewﬁ an E. ESE “85 555559“
Eu:on 2: 3 ﬁg 2: E E mEE
A? .5 am 5:05 Ea 25m ﬁﬂamwlﬁoe m5
.é xv 5 mm a 5% 58:93 uﬁ E 853?,
0.8: wk 358 ES: mgr: 2: Swap EoEEp
35 FE bﬁﬁ E ~me 88:83 RE F
This corresponds to lectures 15 to 23 on Angel.
Sampling distributions (chapter 9)
population parameter = population mean = u
population standard deviation = o
sample mean= the mean of this sample, also one example of a sample
statistic= _ X.(random varia
ESL015 Section 003
Due Saturday April 9th 11:59 pm
Type in your answers and then upload it to Angel drop box:
Homework, peer review, reading assignments > Self-study homework drop
Name your file as Self-study homework _First Name.
LAB ACTIVITY 16 (Hypothesis tests One proportion)
Quiz will be on activity 1
P-value Guidelines when using Standard Normal Table (i.e. the Z-table):
Keep this in mind: The method for finding the p-value is based on the alternative hypothesis.
LAB ACTIVITY 19
(Confidence Intervals for the Population Mean of Paired Differences AND
Confidence Intervals for the Difference in Two Population Means (Independent Samples)
To understand the context for todays lab, observe the following.
We first conside
LAB ACTIVITY 23 (Review)
The High School and Beyond data is from a large-scale longitudinal study conducted by the National
Opinion Research Center (1980) under contract with the National Center for Education Statistics. Below
is a table repre
LAB ACTIVITY 12 (Continuous Random Variables: Sections 8.5, 8.6)
To find probabilities for a normal random variable in Minitab, use:
Calc probability distributions normal
To find P(X is less than or equal to a), choose cumulative probability, fill in the
LAB ACTIVITY 20 (More on proportions)
A Gallup poll asked the question, In general, do you feel that the laws covering the sale of firearms should be made: more strict, less strict, or kept as they are now? Of the n = 493 men who
Gathering Data (7 problems)
Study 1: A British study compared 966 patients with brain cancer to 1716 patients without brain cancer.
The patients cell phone use was measured using a questionnaire. Cell phone use for the two groups was
Probability (5 questions)
Problem 1: Your professor gives a surprise quiz consisting of 10 true/false
A) What is the complement of the event of getting at least one question wrong?
Getting all of the questions right
B) With random guessing for
Data Set used for this activity: Class Survey
1 Suppose a medical researcher compares the average blood pressures of women
who take oral contraceptives to the blood pressures of women who do not.
a. Is blood pressure a categorical variable
Confidence Intervals Proportion and One Mean
1 The term sampling frame refers to the group that actually had a chance to get into the sample. Ideally,
this is the same as the population of interest, but sometimes it isnt. In the following situation, descr
No Data Set Needed
1 Suppose that medical researchers want to estimate the true proportion of all teenagers with
high blood pressure whose blood pressure would decrease if they took calcium supplements.
To test this, they plan a cli
No data sets used for this activity
Non-response bias refers to a bias that occurs if people with a particular opinion or trait tend to refuse
to answer certain survey questions (or possibly refuse to participate in the survey at all).
Analysis of Variance (ANOVA)
One-way analysis of variance is a method for comparing several population means, when the
data are from independent samples. It can be thought of as a tool for examining the relationship
between a quantitative response variabl