Statistics and samples
What is statistics?
Biologists study the properties of living things. Measuring these properties is a challenge, though, because no two individuals from the same biological
Deaths by:
Infectious disease
Wounds
Other causes
June
May
Displaying data
he human eye
Appendix 3. Statistical tables
This appendix gives numerical values for a few of the most commonly used probability
distributions. More can be found in references such as Rohlf and Sokal, Biostatistics
Goals of experiments
! Eliminate bias
! Reduce sampling error (increase
precision and power)
Controls
! A group which is identical to the
experimental treatment in all respects
The normal distribution is very
common in nature
Normal distribution
0.4
f ( x) =
1
2!"
2
e
#
( x # )
2"
2
2
0.3
0.2
Human body temperature
0.1
-2
-1
0
1
Measurement
A normal distribution is fully described
Measures of location
Mean
Median
Mode
Mean
n
!Y
i
Y=
i=1
n
n is the size of the sample!
Mean
Median
Sample size 10 from Normal distribution with =13 and !2=16
Estimating with uncertainty
Chapter 4
Frequency!
2
1.5
1
0.5
5
10
25
X = 13.5
s 2 = 12.1
X
2
2
1.5
1
0.5
5
20
Y
Some test are designed to contrast the relationship between two or more variables
Others are for single variable
1) Am I looking at one variable, or the relationship between two or more variables?
2)
Comparing means
Paired vs. 2 sample
comparisons
! Tests with one categorical and one
numerical variable
! Goal: to compare the mean of a
numerical variable for different groups.
Paired comparisons all
Regression
Correlation vs. regression
! Predicts Y from X
! Linear regression assumes that the
relationship between X and Y can be
described by a line
Regression assumes.
! Random sample
! Y is normal
Publication bias
Researcher and statistician
error
Papers are more likely to be published if P<0.05
~8% of
biomedical
papers have
substantial
statistical
flaws
This causes a bias in the science report
Analysis of variance (ANOVA)
Comparing the means of more
than two groups
Null hypothesis for simple
ANOVA
1 = 2 = 3
! H0 : Variance among groups = 0
X1
OR
X2
X3
Not all 's equal
HA: at least one
N
= 67.4
Inference about means
! = 3.9
Because Y is normally distributed, we can convert
its distribution to a standard normal distribution:
Y is normally distributed
Y = = 67.4
!Y =
whenever:
Assumptions of t-tests
! Random sample(s)
! Populations are normally distributed
! (for 2-sample t) Populations have equal
variances
Detecting deviations from
normality: by histogram
Frequency
Biomass
Proportions
Example:
2092 adult passengers on the
Titanic;
654 survived
Proportion of survivors = 654/2092
! 0.3
A proportion is the fraction of individuals
having a particular attribute.
Probability
Sir Francis Galton
The history of statistics has its
roots in biology
Inventor of fingerprints,
study of heredity of quantitative traits
Regression & correlation
Karl Pearson
PolymathStudied genetics
Hypothesis testing
Hypothesis testing asks how unusual it is to
get data that differHypothesisthe nullnutshell
from testing in a hypothesis.
We want to know something
about this population, say, are
I
Discrete distribution
Fitting probability models to
frequency data
A probability distribution describing a
discrete numerical random variable
For example,
! Number of heads from 10 flips of a coin
! N