Midterm 1 Solutions
Thursday 10/1/09
70 min. Closed book. Calculator allowed. Formula sheets and tables provided.
36-401 HOMEWORK 9 SOLUTIONS
December 10, 2009
1. Missingness EDA
(a) Our data matrix is of size 522 11, and there are 185 missing entries over 3% of the data. Of
the 522 observations, 170 have at leas
Data Analysis Exam 1
This exam is a week-long take-home data analysis exam. You are allowed to use your textbook
as well as other reference books you feel
Some Review Problems for Multivariate Linear Regression
36-401 Formula Sheet Midterm 2
Some standard errors:
s2 = M SE (X X )1
s2 = M SE Xh (X X )1 Xh
Y
h
s2 h(new) = M SE 1 + Xh (X X )1 Xh
Y
Some model criterion:
Cp =
SSEp
M SE (f ull)
P RESSp =
n + 2p
(
Some Review Problems for Multivariate Linear Regression
36-401 Midterm 2 Solutions
Thursday 11/12/09
75 min. Closed book. Calculator allowed. Formula sheets and tables provided.
Data Analysis Exam 2
This exam is a week-long take-home data analysis exam. You are allowed to use your textbook
as well as other reference books you fe
Some Review Problems to Work On
Well use the cars data set. These data were recorded in the 1920s and were used to examine the
relationship between how fast a car was going and how much distance it ne
Data Analysis Project 1
Introduction
Looking at the data, we see that mobility.csv follows 729 communities, which can be used to predict economic mobility
using various predictor variables.
mobility <
36-401 HW7
Megan Barlow
November 4, 2015
1. Recall the dataset mobility from the first DAP. In this problem, we will still predict economic
mobility (Yi) from the proportion of people with short commu
36-401 HW5
Megan Barlow
October 20, 2015
1. The file \(gpa.txt\) contains a bivariate dataset for a sample of 120 students at a small college. The
variables are, in order,
GPA = grade point average at
Data Analysis Project 2
Predicting Housing Prices
The topic of interest for this research group is how various variables interact with each other to impact
housing prices. With access to data collecte
36 - 401 Homework 1
Megan Barlow
September 9, 2015
(3) Translating math into R (12 total; 1 pt each) Give an R expression which corresponds to each of these
mathematical formulas. Say whether it con-
401_HW4
Megan Barlow
September 30, 2015
1.
2. Diagnostics and Transformations
mpg_data <- read.csv("http:/www.stat.cmu.edu/~cshalizi/mreg/15/hw/04/auto-mpg.csv")
1. (3pts) Someone argues that a linear
Data Analysis Project 2
Introduction: Predicting Bike Rentals
The topic of interest for this research project is the daily level of bike rentals based on environmental and
seasonal variables. A bike r
What Factors Help Predict Awarded Damages in Court Cases?
Introduction:
In order to better understand the civil trial system in the United States, particularly the amount of damages
awarded, the Civil
Data Analysis Final Exam
This exam is a week-long take-home data analysis exam. You are allowed to use your textbook
as wel
36-401 HOMEWORK 3
Due: Thursday 9/17/09 at 3pm
36-401 HOMEWORK 9
Factors Associated with Hospital-Acquired Infections
Introduction:
The Study on the Efficacy of Nosocomial Infection Controls primary objective was to analyze
whether or not infection surveillance and
Data Analysis 2 Sample Report
Introduction
Using a subset of the data acquired in 1975 and 1976 for the Study of the Ecacy of Nosocomial Infection
Control (SENIC), we study the relationship between pr
1
36-401 - Homework 8 - Solutions
1. Problem 6.5 bd
b Estimated Regression Function
First I perform the lm() command in R
> data=read.table("CH06PR05.txt")
> summary(lm(d
36-401 HOMEWORK 8
36-401 HOMEWORK 7 SOLUTIONS
November 2, 2009
1. Textbook 7.5
(a) SSR(X2 ), SSR(X1 |X2 ), SSR(X3 |X1 , X2 )
To compute this we re-run the regression analysis in the order needed and create the ANOVA
ta
1
36-401 - Homework 6 - Solutions
1. 6.15
(a) EDA: Satisfaction is roughly symmetric with a mean of 61.57 and goes from 26 to 92. Age is
Histogram of Age
6
2
4
Frequency
36-401 HOMEWORK 7
36-401 HOMEWORK 6
