Statistics 104 – Spring 2011 Section #9 Topics for Section Inference for Regression Omnibus F -test, t -test for a specific β Interpretation of coefficients Model selection Practice Problems An investigator was interested to see if one can predict how a state voted in the 2008 presidential election. She thinks the following attributes might be related to how a state voted. Y = McCain vote percentage, X 1 = % college graduates, X 2 = per capita Gross State Product (GSP), X 3 = % smokers, X 4 = % heavy drinkers The investigator collects data from all 50 states. She first looks at correlations between all variables. . corr mccain college gsp smokers drinkers (obs=50) | mccain~e college gsp smokers drinkers -------------+--------------------------------------------- mccainvote | 1.0000 college | -0.6113 1.0000 gsp | -0.3054 0.6236 1.0000 smokers | 0.3310 -0.6382 -0.2696 1.0000 drinkers | -0.5829 0.3845 0.2783 -0.3299 1.0000 a. Which of the X variables will have the most significant simple linear regression fit with McCain voting percentage? The investigator then looks at all the simple regressions, listed at the end of the handout. b. Which variable seems to be the strongest predictor of McCain voting percentage? Why?

