STAT5101: Foundations of Data Science
Assignment 3
Academic year 15/16, First term
Deadline: During Class, Nov 10 (TUE), 2015.
1. The following data represent the number of days absent per year in a p
1. The following data represent the number of days absent per year in a p
1. Suppose the following information is obtained from Robert Keeler
Learning Objectives
In this chapter, you will learn:
To distinguish between di
Chapter Goals
After completing this chapter, you should be able to:
Compute and inte
Chapter Goals
After completing this chapter, you should be able
to:
Int
Find quartiles Q1, Q2, Q3 and mode:
x1 , , xn raw data
n sample size
qi the rank (position) of Qi
[qi ] the integer part of qi , i = 1, 2, 3. e.g. [3.3]=3, [4.8]=[4]
Find Q2 (the second quartile or me
Chapter Goals
After completing this chapter, you should be
able to:
Expla
TABLE E.3
Critical Values oft
For a particular number qf degrees offreedom, entry represents the
critical value qf t corresponding to a speciﬁed upper-tail area (0L) t1a,df)
UPPER-TAIL AREAS
846 APPENDICES
TABLE E.4
Critical Values of x2
2
ZU1u,df)
For a particular number of degrees of freedom, entry represents the critical value of x2 con-esponding to a speciﬁed upper-tail area (on
TABLE E.7
Table of Poisson
Probabilities
For a given value ofl, entry
indicates the probability of a
specified value ofX.
E: Tables 805
X 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.
)IAP P E N D IX E Tables
QTABLE 5.1
.5 Table of Random
,3 Nunwbers 11111 11112 22223 33333
1“ 12345 67890 67890 12345
01 49280 88924 35779 00283 81163 07275 89863 02348
02 61870 41657 07468 0861
Point Estimation
Deﬁnition
Armin: gs'timate of song: population parameter 0 is a single numericai Value 60ft: '
statistic 6. The statistic 3 is called the paint estimator. Estimation problems occur fr
3—3.2 The Use 0f PvValues for Hypothesis Testing
Deﬁnition
The P—value is the smallest level of signiﬁcance that would lead to rejection of
the null hypothesis H0.
If 012494935) W
P’l/aolue, 50.034 7
Example 1: Suppose that a friend tells you that he will meet you for lunch
at a restaurant at 12:00pm 10 minutes. What is the probability that your
friend will arrive between 12:05pm and 12:08pm?
Exam
Example 1:
A manufacturer of semiconductor devices takes a random
sample of size n of chips and tests them, classifying each chip as defective or
non-defective. Let Xi = 0 if the chip is non-defective
1. The population mean waiting time to check out of a supermarket has been
1. The time between arrivals of customers at a bank during the noon to 1
Summary of Chapter 9
1
Concepts
Chi-Square Tests:
Test statistic 2 =
P
all cells
(fo fe )2
,
fe
where fe =
ni+ n+j
n
Concepts
Examples
1. 2 test for the difference between two proportions:
STAT 5101: Foundations of Data Science (2014 - 2015)
Mid-Term Examination
Oct.21, 7:00pm - 9:00pm
Answer all questions.
1. (10%) Multiple-choice test questions.
(1) The width of each bar in a histogra
1. For each of the following variables, determine whether the va
1. The manager of a paint supply store wants to determine whether the mean amount of paint contained in 1-gallon cans
1. Children in the United States account directly for $36 billion in sale
STAT 5101 Assignment4
Suggested Solution
1.
a.
, where
Under
b.
the population mean amount of paint
and
. The
test statistics is between two critical values. Hence, at the 0.05 level of
significance,
Solution for STAT5101 Assignment 2
1.
(a) P(10<X<30) = (30 - 10)/120 = 0.1667
(b)
2.
The normal probability plot confirms that the data appear to be approximately normally distributed.
3.
4.
5.
(a)
(b
Solution for ST
TAT5101 Midterm Exam
1. (10%
%)
A, B, D
D, D, D
2. (20%
%)
(c)
(d)
3. (20%
%)
You m
may also caalculate th
he results oof P(X = 0). It being
g close to 0 can also
o
providde us the con
Summary of Chapter 7
1
Concepts
Hypothesis Test for :
X
/ n
1. is known (Z test): Test statistic Z =
Concepts
Examples
Two-Tail Test: H0 : = 0 , H1 : 6= 0
1) Use critical value: If Z > Z/2 or Z < Z/2