Week 1: Review of statistics concepts
Before we can begin to learn about regression, we need to review a few important and
fundamental statistical results that will be used throughout the course.
Sample vs population
Repeated samples and properties of e

Shuhang Chen sc3880
Lab: Thursday 1:00-2:40
Homework # 4.1-4.2(Real Data)
1)
(a) I selected to use the dataset National Education Longitudinal Survey of 1988 (NELS88).
(b) A two-stage stratified probability sampling design was used.
The first stage sample

HUMD5122 Homework#3.3(Non-linearity)
Shuhang Chen sc3880
Lab: Thursday 1:00-2:40
1) Choosing the Prestige.sav data and select prestige as the outcome variable
(Y), income as the explanatory variable (X1), education as (X2).
(a) Y=0+1X1+2X2+3X1 ^2+4X2^2+5X

Homework#2.2 (Multiple regression)
Shuhang Chen sc3880@tc.columbia.edu
Lab Section: Thursday 1:00-2:40 Lauren
1). I choose Hours as X1, Statmood as X2, and Exam1 as Y in the Students.sav Doc.
(a) Y =A+ B1X1+B2X2+e
B1 is the effect of a 1-unit increase in

Homework#5.2
Shuhang Chen sc3880
Lab: Thursday 1:00-2:40
1) Choosing the Prestige.sav dataset and select prestige as the outcome variable
(Y), income as the explanatory variable (X1), education as (X2). Prestige
=a+1income+2Education+
Y=a+1X1+2X2+
(a)
T

Homework#5.1 (Outliers, leverage, influence)
Shuhang Chen sc3880
Lab: Thursday 1:00-2:40
1) Choosing the Prestige.sav dataset and select prestige as the outcome
variable (Y), income as the explanatory variable (X1), education as (X2).
Prestige =a+1income+