This preview shows pages 1–3. Sign up to view the full content.
This preview has intentionally blurred sections. Sign up to view the full version.
View Full Document
Unformatted text preview: STAT3503 Assignment # 2 Question 1. ( Use SAS to solve this question ) A criminologist studying the relationship between level of education and crime rate in mediumsize U.S. counties collected the following data for a random sample of 84 counties; X is the percentage of individuals in the county having at least a highschool diploma, and Y is the crime rate(crimes reported per 100,000 residents) last year. Assume that firstorder regression model is appropriate. [1] (a) Obtain the estimated regression function. [1] (b) Test whether or not there is a linear association between crime rate and percentage of high school graduates, using a t test with = 0 . 01. State the alternatives, decision rule, and conclusion. What is the Pvalue of the test? [1] (c) Set up the ANOVA table. [2] (d) Carry out the test in part (b) by means of the F test. Show the numerical equivalence of the two test statistics and decision rules, explain it. Is the Pvalue for the F test the same as that for the t test, why? [1] (e) Obtain R 2 and r. [1] (f) Obtain the residuals e i and prepare a box plot of the residuals. What information is provided by your plot. [1] (g) Plot the residuals e i against the fitted values X i to ascertain whether any departures from the simple linear regression model. State your findings. [1] (h) Prepare a normal probability plot(QQ plot) of the residuals. Does the normality assumption appear to be reasonable here? [2] (i) Use the BrownForsythe test to determine whether or not the error variance varies with the level of X. Divide the data into the two groups, X 69 , X &gt; 69, and use = 0 . 05. State the decision rule and conclusion. Does your conclusion support your preliminary findings in part (g)? SOLUTION (a) b Y = 20518 170 . 58 X (b) H : 1 = 0 , H : 1 = 0. 1 Since s ( 1 ) = 41 . 5743, t * = 170 . 575 41 . 5743 = 4 . 1029 , because  t *  &gt; t (0 . 995;82) = 2 . 63712. Conclude H . Pvalue = Pr (  T  &gt; t * ) = 0 . 000096. (c)The ANOVA tale is Source DF Sum of Squares Mean Square Model 1 93462942 93462942 Error 82 455273165 5552112 Total 83 548736108 (d) H : 1 = 0 , H : 1 6 = 0....
View
Full
Document
This note was uploaded on 09/30/2010 for the course STAT 3503 taught by Professor Smills during the Spring '07 term at Carleton CA.
 Spring '07
 smills

Click to edit the document details