WELCOME to Statistics 416
STAT 416
Emphasis on application and practical use of statistical methods for designing and analyzing microarray experiments. Students completing STAT 416 should be able to
Statistics 416
HW2
Solutions
1. (20 points)
Create and display a vector of the consecutive integers 1 though 12.
Create a vector of the numbers 9, 5, and 3.
Create the vector 1,2,3,4,5,6,9,5,3 and ass
Stat 416
1.
Homework 3
Solutions
(a) (2 points) Blocking is not used in this experiment. Blocking was dened in our notes as grouping similar experimental units together and assigning different treatme
Stat 416
Homework 4
Solutions
Exam Corrections (10 points)
1. (6 points) For the complete experiment, we have the following table of factors with levels.
Factor
Levels
Diet
chimp
McDonalds
Tissue
brai
STAT416
HW5
Solutions
1. a) (7 points) An interaction between the factors diet and batch would mean
that the differences among the diets would be different for the two batches.
>
>
>
>
>
>
>
>
>
+
+
+
Exam 1 from a Past Semester
1. Provide a brief answer to each of the following questions.
a) What do perfect match and mismatch mean in the context of Affymetrix GeneChip
technology? Be as specific as
Solutions to Exam 1 from a Past Semester
1. Provide a brief answer to each of the following questions.
a) What do perfect match and mismatch mean in the context of Affymetrix GeneChip
technology? Be a
Statistical Design and Analysis of Microarray Experiments.
STAT 416

Spring 2014
Statistical Design and Analysis
of Gene Expression Experiments
First Lecture!
An Overview
1
Central Dogma: DNARNAProtein
Illustration provided by the
National Human Genome
Research Institute
DNA
(tran
Statistical Design and Analysis of Microarray Experiments.
STAT 416

Spring 2014
Pooling samples
Pooling of tissues or RNA samples is sometimes
necessary to obtain sufficient RNA for
hybridization for microarray experiments.
Even when pooling is not necessary to obtain
enough sa
Statistical Design and Analysis of Microarray Experiments.
STAT 416

Spring 2014
Bioinformatics processing of NGS data
Peng Liu
1
RNAseq data analysis procedure
Image (PH, Current) analysis to get the
sequences of each cluster (read)
Bioinformatics analysis pipeline
Background
Statistical Design and Analysis of Microarray Experiments.
STAT 416

Spring 2014
Replication in RNAseq experiment
Two types of replications:
Biological replication
Technical replication
The following definitions are from Nettleton,
Chapter 5 of the NGS book, with slight
modific
Statistical Design and Analysis of Microarray Experiments.
STAT 416

Spring 2014
Introduction to Experimental Design
1
Terminology
Experiment An investigation in which the
investigator applies (assigns) some treatments
to experimental units and then observes the
effect of the tre
Statistical Design and Analysis of Microarray Experiments.
STAT 416

Spring 2014
Designs that have been discussed
Full factorial treatment design
Experimental designs:
Completely Randomized Design
Randomized Complete Block Design
Latin Square Design
Incomplete Block Design (i
Stat 416
Homework 1
Solutions
1. (15 points) Many of you had difculties with this problem. The notation calls for one circle for each
experimental unit. The most common mistake was to use one circle f
Solutions
Statistics 416
Exam 1
March 5, 2009
1. It is possible to use more than two dyes and, therefore, more than two samples on a single
slide. Suppose that a fourdye system has been developed so
Name: _
Statistics 416
Exam 1
March 5, 2009
1. It is possible to use more than two dyes and, therefore, more than two samples on a single
slide. Suppose that a fourdye system has been developed so th
Central Dogma: DNA RNA Protein Statistical Design and Analysis of Microarray Experiments
First Lecture! 1/15/2008
Illustration provided by the National Human Genome Research Institute
1
2
Microarray T
Microarray Technology Statistical Design and Analysis of Microarray Experiments
1/17/2008 Peng Liu
How to measure expressions of thousands of genes simultaneously? Two types of fabrication for microar
A microarray scanner creates a digital image of a microarray. A digital image is a rectangular array of intensity values.
The Basics of Microarray Image Processing
1/22/2008
Each intensity value corre
Prenormalization analysis Prenormalization Methods for TwoColor Microarray Data
1/22/2008
Image processing Background correction Filtration Transformation
1
2
From this image
We get a data table th
Normalization Normalization Methods for TwoColor Microarray Data
1/24/2008 Peng Liu Normalization does not necessarily have anything to do with the normal distribution that plays a prominent role in
Withinslide normalization LOWESS Normalization for TwoColor Microarray Data
1/29/2008 Peng Liu
This is done separately for each slide. The purpose is to make red intensities and green intensities co
In last lecture
Normalization Methods for TwoColor Microarray Data (continued)
1/31/2008 Peng Liu
Withinslide normalization
Intensitydependent dye effect LOWESS normalization
LOWESS can be applied
A Probe Set for a Particular Gene in GeneChip
gene sequence .TGCAATGGGTCAGAAGGACTCCTATGTGCCT. perfect match sequence AATGGGTCAGAAGGACTCCTATGTG mismatch sequence AATGGGTCAGAACGACTCCTATGTG probe pair pr
Terminology
Experiment An investigation in which the investigator applies (assigns) some treatments to experimental units and then observes the effect of the treatments on the experimental units by me
Suppose we have 24 experimental units and would like to compare the effects of 4 treatments on gene expression. Use a completely randomized design to assign 6 experimental units to each treatment.
2 3
Statistical Models
Introduction to Mixed Linear Models
A statistical model describes a formal mathematical data generation mechanism from which an observed set of data is assumed to have arisen.
2/14/
Matrix Introduction to Matrix Algebra Useful for Statistics
Peng Liu 2/19/2008 A matrix is a rectangular array of elements arranged in rows and columns. An example:
Column 1 2 3 4
1
Row 1 1 2 3 4 A= R