Statistics 5021 – Homework 7
There are 20 total points. This homework is due Thursday, April 28 in your lab section.
In this homework, you will analyze datasets (preferably using
R
).
1. Two numerical characteristics were measured for 18 countries. The first characteristic:
mortality
is the mortality rate from heart disease per thousand, and the second
characteristic:
consumption
is the average per capita consumption of wine in liters.
These data are available for download on Moodle in the file “wine.txt”. Our interest is
to relate
mortality
with
consumption
using the simple linear regression model, where
mortality
is the response and
consumption
is the predictor.
(a) Produce a scatter plot of the response versus the predictor. Based on this scatter
plot, would a simple linear regression model be appropriate?
(b) Instead of predicting
mortality
with
consumption
, consider predicting log(
mortality
)
with log(
consumption
), where log() is the natural logarithm. Assuming that we
read the dataset in as:
wine = read.table("wine.txt")
