Final Exam Practice SOlutions–ANOVA and Logistic RegressionANOVA(1) ANOVA Basics:(a)The completed ANOVA table is shown below. The degrees of freedom must add so we havethat the within group degrees of freedom is 28. We know that mean squares are sums of squaresdivided by degrees of freedom. Sence SSB/3 = 16.42 for the between groups, or SSB =(3)(16.42)= 49.26. Similarly, MSW = SSW/DF = 102.5/28 = 3.66. We also know that the sums of squaresadd so SST = SSB+SSW = 49.26+102.5 = 151.76. FInally, F = MSB/MSW = 16.42/3.66 = 4.49.SourcedfSSMSFBetween (Treatment)349.2616.424.49Within (Error)28102.503.66Total31151.76(b)We know that the degrees of freedom between groups is just the number of groups minus 1.This comes from the fact that in a regression we would only need G-1 indicator variables to coverall the groups with the remaining group as the reference. Thus there are G=4 treatment groupshere. Note that the term “treatment” refers to the group label. This terminology comes from thefact that in medical studies the groups often represent the different treatments or drug dosages thatare given to patients and whose efficacy the study seeks to compare. However a treatment couldalso be something like gender or smoking status.(c)Itispossible the ANOVA table comes from abalanced randomized design.A balanced de-sign would have the same number of points in eachg roup.Here there are 4 groups and a totalof n=32 data points.The latter we know because the total degrees of freedom was 31 in theANOVA table and we know that this is just n-1. With 32 points and 4 groups we could have 8points per group–a balanced design. Of course, the design could have been unbalanced-we can’ttell for sure from just the table–and we know nothing at all about whetehr it was really randomized.(d)The p-value we want isP(F3,28≥4.49).From the F table in your text (which you are NOT re-sponsible for using) the closest number of degrees of freedom is 3 and 30. We see thatF3,30,.99= 4.51which is almost exactly our value so we conclude the p-value is very close to .01. Using a significancelevel ofα=.05 we would rejectH0and conclude that at least some of the means in this problem(whatever it is about!) are different. If we wanted to use STATA to get the exact p-value we wouldtypedi Ftail(3,28,4.49)The “di” stands for display. “Ftail” tells STATA to give us the probability of being bigger thanthe value indicated. The first two numbers in parentheses are the degrees of freedom and the finalnumber is the F statistic. Note that this approach is better than the one I originally posted witht1