Chapter 8
Definition (review)
A parameter is a numerical measurement describing some characteristics of a population.
Any quantity computed from values in a sample is called a statistic.
The observed value of a st
Home work
STA 1053
Sudip Roy
Total points 30 (Each question 5 points)
(Due date 27th June Monday)
Write your answer on separate page and submit it stapled with the question paper.
Q.1
The Nielsen Company conducts surveys each year about the use of various
Chapter 5
Probability
1
Introduction to Probability
A probability is a numerical value that indicates the chance,
or the likelihood, of a specific event occurring.
Weather forecaster commonly use probability when
referring to the chance of rain the foll
Chapter 2 (part 2)
Graphical Methods for
Describing Data
Distributions
Displaying Categorical
Data
Bar Charts
Bar Chart
When to Use: Univariate, Categorical data
To comply with new standards from the U. S. Department of Transportation,
helmets should reac
Chapter 3
Numerical Methods
for Describing Data
Distribution
Suppose that you have just received your
score on an exam in one of your classes.
What would you want to know about the
distribution of scores for this exam?
Measures of center describe were the
Chapter 4
(Section 4.1)
Describing Bivariate
Numerical Data
Correlation
Pearsons Sample Correlation
Coefficient
Properties of r
Does it look like there is a relationship between the two variables?
If so, is the relationshipYes
linear?
Yes
Does it look lik
BASIC STATISTICS
STA 1053
Sudip Roy (ROY)
Chapter 1
Collecting Data in
Reasonable Ways
Data and conclusions based on data are everywhere
Newspapers
Magazines
On-line reports
Professional publications
Population the entire collection of
individuals or
Chapter 2 (Part 1)
Graphical Methods for
Describing Data
Distributions
Variable
any characteristic whose value may change from one
individual to another
n
o
ti
ia
l
f
a
l
a
c
Numbe
ti
i
l
o
r of tex
P
tbooks
purchas
ed
ge
e
l
l
o
c
o
t
e
m
o
h
m
o
r
f
Di
Lecture 9
Analysis of Variance (ANOVA)
The statistical methodology for comparing several means is called
analysis of variance, or ANOVA. We will consider two ANOVA
techniques:
One-way ANOVA:
- is used when there is only one way to classify the populations
Chi-Square Test for Goodness of Fit
(after Applied Statistics by Hinkle/Wiersma/Jurs)
Scientists will often use the Chi-square ( 2 ) test to determine the goodness of fit between theoretical
and experimental data. In this test, we compare observed values
Name:
10/10/13
STA 1053.1- Handout #11
Hypothesis Testing using Proportion (p)
Multiple Choice:
1. You are interested in determining whether there is strong evidence in support of the
claim that less than 40% of retired adults have a part-time job. To ans
Analysis of Variance (ANOVA) Single Factor
In the past, when we needed to compare population means, we were restricted to determining the
existence of a significant difference using only two data sets. Lets suppose that you, the
statistician, needs to con
SI Information
SI Leader: Laura
Email: si.basicstats@gmail.com
Twitter: si_statistics
Office Hour: 11-11:50am 4th Floor of JPL (use the elevators)
Session times:
Monday: 3-3:50pm *Meet outside classroom*
Thursday: 12-12:50pm JPL (2nd Floor) Study Room #6
Chi-SquareChapter 11, Sections 11.1-11.2
Your text discusses three types of chi-square tests:
1. Goodness-of-Fit
2. Independence
3. Homogeneity of Proportions
We use the Greek symbol 2 (pronounced k i square to rhyme with sky-square) to
represent values o
THE CHI-SQUARED GOODNESS-OF-FIT TEST
Example. Ten years ago the distribution of fish at Simple Lake was as follows:
Type of Fish:
Proportion:
Bass
.10
Catfish
.25
Goldfish
.30
Trout
.35
Recently, a random sample of fish from Simple Lake provided the follo
Chapter 11
Hypothesis Testing Using the
One-Way Analysis of Variance
ANOVA
The analysis of variance (abbreviated ANOVA) is a
powerful hypothesis testing procedure that extends
the capability of t-tests beyond just two samples.
There are many types of ANO
EXTRA CORRELATION PROBLEMS
It is hypothesized that there are fluctuations in norepinephrine (NE) levels which accompany
fluctuations in affect with bipolar affective disorder (manic-depressive illness). Thus, during
depressive states, NE levels drop; duri
Correlation
When two sets of data are strongly linked together we say they have a High Correlation.
The word Correlation is made of Co- (meaning "together"), and Relation
Correlation is Positive when the values increase together, and
Correlation is Negati
ANOVA Simple Linear Regression
Example 18-1
x2 41222.14 592.62 2 6104.66
592.6
2 x
2
lxx x
lxx x
41222.14 10 6104.66
nn
10
2
y2 220360.47 1428.702 2 16242.10
1428.70
2 y
2
l yy yy
l yy
n 220360.47 10 16242.10
10
n
592.6 1428.70
x y
lxy xy x y 918
Coefficient of Determination Formula
Coefficient of Determination is one of the most important tools in statistics which is widely used
in data analysis in economics, physics, chemistry and many more fields. Coefficient of
determination allows us to forec
Chi-squared test for nominal (categorical) data
The c2 test is used to determine whether an association (or relationship) between 2 categorical
variables in a sample is likely to reflect a real association between these 2 variables in the
population.
Note
Quiz 2 :STAT 1053- 001(Fall 2012):
1. The pie chart below summarizes the results of a survey of 300 randomly selected students
at a particular high school. The investigators asked about soft drink preferences at a local
high school.
SoftDrinkPreference
Co