20 Pages

30_TRG_Chapter23

Course: STAT 125, Spring 2008
School: Stanford
Rating:
 
 
 
 
 

Word Count: 5654

Document Preview

VI 23-1 Part : Learning About the World : Chapters 23 25 Now that we have the principles of inference under our belts, its an easy slide downhill to the finish line. The mantra of this Part is just like the other ones, only a little different. Emphasize that, and students wont feel overwhelmed thinking there are new formulas and methods raining down on them with each chapter. If you havent done it yet, point...

Register Now

Unformatted Document Excerpt

Coursehero >> California >> Stanford >> STAT 125

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
VI 23-1 Part : Learning About the World : Chapters 23 25 Now that we have the principles of inference under our belts, its an easy slide downhill to the finish line. The mantra of this Part is just like the other ones, only a little different. Emphasize that, and students wont feel overwhelmed thinking there are new formulas and methods raining down on them with each chapter. If you havent done it yet, point your students to the inside back cover of this book. The table there organizes all of the inference methods and shows them that they are really all alike only a bit different. And the list of assumptions and conditions should remind students of the things they were looking for in their displays and summaries back in Parts I and II. Chapter 23 Inferences for Means Whats It About? Chapter 23 examines confidence intervals and hypothesis tests for a population mean. We introduce the t-models to deal with the extra variation caused by not knowing the populations standard deviation, and examine the assumptions and conditions necessary for inference. We use this new application of the concepts of inference to re-examine the correct interpretation of a confidence interval, the relationship between confidence intervals and hypothesis tests, and the difference between statistical significance and importance. We show how to determine the sample size necessary for a study about means, and how to interpret output from software packages. TI Tips help students explore the new t-models and do the mechanics for confidence intervals and hypothesis tests about a mean. Comments We almost never do inference pretending to know the population standard deviation. We have emphasized throughout this course that Statistics is about the real world, and we wont change that now. In the real world no one ever knows the population standard deviation. (Well, almost never. There are cases when experience suggests that the standard deviation is known the hospital example in the book is a case in point. In that rare case, we can use the z instead). Gosset discovered the t-distributions to resolve this dilemma. What worked for Gosset works for us, too. And we never mention the outdated pretechnology approach of using a Normal model for large samples. You might think there was a compelling mathematical justification for switching over to z when the sample size exceeds 30, but there wasnt. Typesetters, not mathematicians, are the source of that rule of thumb. About 30 lines of a t-table fit conveniently on a page. After that, the old advice was, turn to the z-table because its pretty close. In the age of graphing calculators and statistics software, theres no reason to shy away from t for any number of degrees of freedom, so we can do better than approximate it with z. We can do inference for means the right way, using t. Always. The bottom line is that nothing else in this chapter is really new. We still create and interpret confidence intervals the same old way. We still use the same reasoning and four-step procedure for testing a hypothesis. The only new issue is the need to scrap the trusty Normal model in favor of a t-model, an adjustment students will readily make. Copyright 2010 Pearson Education, Inc. 23-2 Part VI Learning About the World Weve been very careful about the term standard error, and here we get some of the benefit. The standard error SE (y ) = s is a standard error because it is estimated from the data. And, of n course, that fact is central to our use of Students t. Remember also that t-models are just that models. Students most common confusion at this stage is between the standard deviation of the data, s, and the standard error we estimate for the mean, SE (y ). They need to be able to distinguish between the actual distribution of the data on the one hand, and the model for the sampling distribution of the mean on the other. Talk about the model often and clearly to help them keep that distinction clear. With the t-models we can show a simpler, clearer relationship between confidence intervals and hypothesis tests. Students should see these as two ways of viewing essentially the same calculations. That reduces by half how much new stuff there is here to learn. Looking Ahead In the next two chapters we expand our abilities to do inference for means when there are two groups. Chapter 24 explains the proper procedures if the two groups are independent, and Chapter 25 looks at inference for paired differences if the groups are matched. Both chapters continue to rely on t-models. Students t returns again in Chapter 27 where we discuss inference for regression. Class Dos Start right in with an example involving data like our coffee machine suggestion. The chapter discusses confidence intervals first, then moves on to hypothesis tests. You should look at the two types of inference in that order if you plan to spend two days introducing them. Its also possible that you can do both in one day a hypothesis test with a follow-up confidence interval. Students will be able to set up the outline of the hypothesis test. After all, theyre all the same. Hypotheses. This time theyre about . (Never, y of course. We know y ; that value comes to us from the sample. Its the mean of the population we wonder about.) Model. Some of the conditions are obvious. We need a simple random sample of less than 10% of the population (and we wont worry as much about the 10% Condition for quantitative data). Well add another condition after some more discussion. Mechanics. It seems pretty straightforward at first. Find y , then try to find z. Trouble. The Central Limit Theorem tells us the sampling model for y has a standard deviation of divided by the square root of the sample size. Strike One: we dont know . Students suggest using s, the samples standard deviation, as an estimate. Great idea. This shows they have caught on to the kind of statistical thinking we want them to develop in this course. But theres a problem. Unlike proportions, where the null hypothesis specifies p, and that in turn tells us the SD, here there is no link to the SD. The hypothesized population mean tells us nothing about the populations standard deviation. And the sample standard deviation, s, varies from sample to sample. This introduces variation in the denominator, something that wasnt there before. How can we deal with this extra uncertainty? Some students may suggest cutting ourselves a little more slack to make up for the increased variability, and thats exactly what a t-model does. Copyright 2010 Pearson Education, Inc. Chapter 23 Inferences About Means 23-3 Spend some time briefly describing t (suggestions follow). Note that the mathematics underlying the use of t requires the assumption that the sample comes from a population that is Normal. Strike Two: we dont know the populations center (our hypothesis is about that), we dont know its spread (thats why we need t), and we dont (indeed cant) know its shape, either. All we can do is take a look at the distribution of the data in the sample to see if its plausible that these data were chosen at random from a Normal population. We hope to see a roughly mound-shaped, symmetric distribution without skewness or outliers. Thats the Nearly Normal Condition, telling us its OK to use a one-sample t-test for means. With the conditions now complete, get right on with the hypothesis test. Calculate t, draw the curve (indicating the degrees of freedom), shade the region representing the P-value, and find P (using technology or a t table for critical values). Conclusion. Its the same old story: linkage, decision, and conclusion in context. Be sure the conclusion talks about the mean of the population. Its a great idea to follow the hypothesis test with a confidence interval, looking up the critical value in the table. Spend some time with the interpretation. There are all kinds of new ways to be wrong about what the interval means! Most common are that some students will think that 95% of the population or 95% of all sample means must lie in the interval. Help students understand the correct interpretation. The Investigative Task on means will help. Since the t-models are new, it helps to discuss their relationship to the Normal model. Common sense says that the increased uncertainty caused by using s to estimate makes this model more spread out, and that this uncertainty (and therefore the spread) must be greater for small samples. These ideas tell us that there are different t-models for different ns, pointing to the new issue of degrees of freedom. Greater spread tells us that confidence intervals must be wider, so critical values of t must be larger than the corresponding critical value of z. A quick glance down one of the columns of the table will show that, and also show that t-models look increasingly Normal as n increases. The Nearly Normal Condition is intentionally vague. First, be sure students understand that the issue is not whether the sample is Normal. Samples arent; they cant be. We must have a sample drawn from a population thats Normal, but we can never be sure about the population. The t-test is fairly robust. What we really need is that the distribution is roughly symmetric i.e., nearly Normal. Were particularly concerned about outliers and skewness. However, the usefulness of a tmodel also depends on the sample size. As the sample size increases, the Normality of the population becomes less critical. (We say that the t-models with larger degrees of freedom are increasingly robust with respect to the assumption of Normality.) For ns near 20, some skewness is okay. For ns near 30, we can use t unless the sample is strongly skewed or there are far outliers. For larger samples, we often dont even look at the histogram, justifying use of a t-model simply by saying the sample is large. Talk about what to do with outliers. We recommend performing the analysis twice. The concern, of course, is that the outlier will distort the results. If the conclusions dont change (if, for example, we reject a hypothesis when we include the outlier in the data, and also reject it after we remove the outlier), then its clear that the decision is not based on the impact of the outlier. Generally, outliers inflate the standard error more than they move the sample mean, so they make confidence intervals wider and P-values larger. Copyright 2010 Pearson Education, Inc. 23-4 Part VI Learning About the World We resort to z only when we do a sample size calculation, because not knowing n leaves us unable to pick a critical value for t. This calculation will always underestimate the sample size we should use. If it calls for a very large sample, thats fine. But if it points to a small sample, then we need to use that first estimate for n to determine a critical t and rerun the calculation. The Importance of What You Dont Say Dont suppose we know , the standard deviation of the population. We (almost) never will. Dont suggest that we use z instead of t for large samples. For inference about means, use t. Always. (Almost.) Dont confuse notation or terminology. We find it confusing to write y = / n . Instead use SD( y ). Of course its the same thing, but students seem to get confused with s all over the place. And the difference between SD( y ) and SE( y ) reminds them of the difference in terminology and meaning. A Calculator Note Newer TI-84 operating systems allow students to find critical t values with InvT. Students with TI83s will need to use the table. (Table T) Or they can trick the TI into doing the same thing. Suppose you want a 95% confidence interval for 10 df. Use 8:Tinterval. Enter x : 0, Sx: (11) , n:11, and C-level:.95. The calculated confidence interval will be the critical values of t. (You may or may not want to show this to your students. After all, if they were going to use the TI instead of the table to get the critical value, why not just have it do the interval?) Class Examples 1. A coffee machine dispenses coffee into paper cups. Youre supposed to get 10 ounces of coffee, but the amount varies slightly from cup to cup. Here are the amounts measured in a random sample of 20 cups. Is there evidence that the machine is shortchanging customers? 9.9 9.9 10.0 9.9 10.0 Hypotheses. H0 : = 10.0 9.7 9.6 9.5 9.6 9.9 10.0 9.8 9.7 10.2 9.5 10.1 9.8 10.1 9.8 9.9 HA : < 10.0 Model. Random sample; 20 < 10% of all cups (no reason to doubt independence). The histogram of sample data to the right looks roughly unimodal and symmetric, so its reasonable to believe that the amount of coffee in all possible cups could be described by a Normal model. Okay to do a t-test for the mean. Mechanics. n = 20 df = 19 y = 9.845 s = 0.199 9.845 10 t= = 3.49 0.199 20 P = P(t19 < 3.49) = 0.0012 Copyright 2010 Pearson Education, Inc. Chapter 23 Inferences About Means 23-5 Conclusion. Such a small P-value makes it unlikely that the low sample mean resulted from sampling error, so we reject the null hypothesis. There is strong evidence that the mean amount of coffee dispensed by this machine is less than the stated 10 fluid ounces. Confidence interval. The conditions have been met, so we can create a one-sample t-interval. (Note that all confidence intervals look alike: estimate margin of error.) * y t19 SE ( y ) = 9.845 2.093 0.199 = (9.75,9.94) 20 We are 95% confident that the machine dispenses an average of between 9.75 and 9.94 fluid ounces of coffee per cup. 2. A company has set a goal of developing a battery that lasts over 5 hours (300 minutes) in continuous use. A first test of 12 of these batteries measured the following lifespans (in minutes): 321, 295, 332, 351, 281, 336, 311, 253, 270, 326, 311, and 288. Is there evidence that the company has met its goal? Solution: We want to know if the mean battery lifespan exceeds the 300-minute goal set by the manufacturer. We have 12 battery lifespans in our sample to test the claim. Hypotheses. The null hypothesis is that the batteries have a mean lifespan of 300 minutes. The 300-minute goal has not been met. The alternative hypothesis is that the batteries have a mean lifespan greater than 300 minutes. The 300-minute goal has been met. H 0 : = 300, H A : > 300 Model. 4 Randomization Condition: This is not a random sample of batteries, but merely 12 batteries produced for preliminary 3 testing. However, it is reasonable to assume that these 2 batteries are representative of all batteries. 1 Nearly Normal Condition: The distribution of battery lifespans is roughly unimodal and symmetric, so its 240 280 320 reasonable to assume that the lifespans of all batteries could Battery Lifespans (min) be described by a Normal model. Since the conditions have been met, we can do a one sample t-test for the mean, with 11 degrees of freedom. Mechanics. t= n = 12 y s n 306.25 300 t= 29.31 df = 11 y = 306.25 P-value = P( y > 306.25) = P(t11 > 0.7387) = 0.238 12 t 0.7387 Copyright 2010 Pearson Education, Inc. s = 29.31 23-6 Part VI Learning About the World Conclusion. Since the P-value is high, we fail to reject the null hypothesis. There is no evidence to suggest that the mean battery lifespan exceeds 300 minutes. It does not appear that the company has met its goal. Find a 90% confidence interval for the mean lifespan of this type of battery. Solution. The conditions have been met, so we can create a one-sample t-interval, with 90% confidence. 29.31 * y t11 SE ( y ) = 306.25 1.796 = (291.05,321.45) 12 I am 90% confident that the mean battery lifespan is between 291.05 and 321.45 minutes. If we wish to conduct another trial, how many batteries must we test to be 95% sure of estimating the mean lifespan to within 15 minutes? To within 5 minutes? Solution: We want to know how many batteries to test to be 95% sure of estimating the mean lifespan to within 15 minutes. First, do a preliminary estimate using z * = 1.96 as the critical value. Our first estimate is about 15 batteries, a small sample. s ME = z * n 29.31 15 = 1.96 n (1.96)(29.31) 15 n 14.67 n= * Now, do a better estimate, using t14 = 2.145 as the critical value. s ME = t * n 29.31 15 = 2.145 n We would need to sample about 18 batteries in order to estimate the mean battery lifespan to within 15 minutes, with 95% confidence. (2.145)(29.31) 15 n 17.56 n= Finally, to estimate the mean battery lifespan to within 5 minutes, you could do the entire process again, perhaps using a critical value with much higher degrees of freedom. We know that its going to take lots more batteries to cut the margin of error to a third of what it was. Alternatively, we know it will take a sample about 9 times as large, 18(9) = 162 batteries, since the margin of error was decreased to a third of its size. (The standard error involves the square root of the sample size in its denominator.) Copyright 2010 Pearson Education, Inc. Chapter 23 Inferences About Means 23-7 3. A few days after you finish this chapter you will be collecting (and then returning) the Tasks. Each student will have created a confidence interval based on a different sample. Tell them what the true mean for the population was. (If you used our data, the mean Math SAT-I score was 611.32.) Ask them how many caught it in their interval. Then ask who missed it. There will probably be a few whose intervals do not contain the true mean yet they did nothing wrong. This teachable moment really helps students understand what 95% confidence means, and why we are so picky about those pesky interpretations. Resources ActivStats Chapter 23. The interactive tool in ActivStats that shows confidence intervals and P-values dynamically is valuable in cementing student understanding. One activity in this chapter differentiates the effects of smaller sample size on degrees of freedom from the effects of sample size on the standard error (in which we divide by the sample size). The change in standard error has a far greater impact than the spreading of the t-distribution with smaller degrees of freedom. TI-Nspire Demonstrations The t-models o Understand the t-models. o See how t-models change as you change the degrees of freedom. o Compare t-models to the Normal model. Confidence Intervals for Means o Generate confidence intervals from many samples how to see often they successfully capture the true mean. Decisions Through Data Video Unit 20: Confidence Intervals done the old-fashioned way, unfortunately, assuming we know the populations standard deviation. There are some nice parts of this video clip, but dont show it if you fear the approach will confuse your students. Video Unit 21: Tests of Significance Workshop Statistics Topic 20: Confidence Intervals II: Means Topic 22: Tests of Significance II: Means Topic 23: More Inference Considerations Copyright 2010 Pearson Education, Inc. 23-8 Part VI Learning About the World Web Links Youll find several good data sets at the DASL Web site. (lib.stat.cmu.edu/DASL) Several Web sites have confidence interval applets, useful for demonstrating the key ideas that the population parameter is fixed while confidence intervals vary from sample to sample, capturing the population value most but not all of the time. Assignments Two days of reading and three assignments of 56 exercises each will suffice for this chapter. Over three days you can do a couple of examples in class and go over homework questions (with time to look at the last unit test). Students will then be ready to tackle the Task as you move on into Chapter 24. Three chapter quizzes are provided. Investigative Task This is the first of a pair of Tasks that use the same data set. The thrust of the Tasks is more important than the particular data you use. We use the Math and Verbal SAT scores reported for one high school, but other data will work. You may want to find something more relevant to your students and revise the Task accordingly. Ideally you need a reasonably large but manageable data set; 250 to 500 cases would work well. Math and Verbal SATs (or ACTs) are ideal because each individual has two scores (making paired comparisons possible) and lots of statistics are available. The first part of the Task asks students to select a random sample, reviewing use of random numbers and sampling issues from Chapter 12. Based on the various samples, students construct confidence intervals, and then use their interval to compare the local performance to state or national results. And this affords you the opportunity to look at all their confidence intervals together, noting that most hit the target while others miss. The second in this pair of these Tasks comes after Chapter 25, asking students to do two hypothesis tests one involving paired data and the other comparing two independent means. Have them hang on to the data sheet; theyll need it again. Copyright 2010 Pearson Education, Inc. AP Statistics Quiz A Chapter 23 Name A professor at a large university believes that students take an average of 15 credit hours per term. A random sample of 24 students in her class of 250 students reported the following number of credit hours that they were taking: 12 13 14 14 15 15 15 16 16 16 16 16 17 17 17 18 18 18 18 19 19 19 20 21 1. Does this sample indicate that students are taking more credit hours than the professor believes? Test an appropriate hypothesis and state your conclusion. 2. Find a 95% confidence interval for the number of credit hours taken by the students in the professors class. Interpret your interval. 23-9 Copyright 2010 Pearson Education, Inc. AP Statistics Quiz A Chapter 23 Key 1. H 0 : = 15 . The mean number of credit hours taken by all of 5 the professors students is 15 credit hours. 4 H A : > 15 The mean number of credit hours taken by all of 3 the professors students is greater than 15 credit hours. 2 * Randomization condition: Students from the class were 1 randomly sampled. * 10% condition: The sample is less than 10% of the class 12 15 18 21 population. Credit Hours * Nearly Normal condition: The histogram of credit hours is unimodal and reasonably symmetric. Under these conditions, the sampling distribution of the mean can be modeled by Students t with df = n 1 = 24 1 = 23. Use a one-sample t-test for the mean. We know: n = 24, y = 16.6 , and s = 2.22 . 2.22 So, SE ( y ) = = 0.453 . 24 y 0 16.6 15 = = 3.532 . t= 0.453 SE ( y ) The P-value is P(t23 > 3.532) = 0.0009 . The P-value of 0.0009 is low, so reject the null hypothesis. There is strong evidence that the professors students are taking more than 15 credit hours, on average. 2. With the conditions satisfied (from Problem 1), we can find a t-interval for mean credit hours. We know: n = 24, y = 16.6 , s = 2.22 , and SE ( y ) = Our confidence interval has the form y tn 1 interval is then 16.6 2.069 2.22 = 0.453 . 24 s . We have t23 = 2.069 . Our 95% confidence n 2.22 = 16.6 0.94 , or 15.66 to 17.54. 24 We are 95% confident that the interval 15.66 to 17.54 contains the true mean number of credit hours that students in the professors class are taking. 23-10 Copyright 2010 Pearson Education, Inc. AP Statistics Quiz B Chapter 23 Name Insurance companies track life expectancy information to assist in determining the cost of life insurance policies. The insurance company knows that, last year, the life expectancy of its policyholders was 77 years. They want to know if their clients this year have a longer life expectancy, on average, so the company randomly samples some of the recently paid policies to see if the mean life expectancy of policyholders has increased. The insurance company will only change their premium structure if there is evidence that people who buy their policies are living longer than before. 86 75 83 84 81 77 78 79 79 81 76 85 70 76 79 81 73 74 72 83 1. Does this sample indicate that the insurance company should change its premiums because life expectancy has increased? Test an appropriate hypothesis and state your conclusion. 2. For more accurate cost determination, the insurance companies want to estimate the life expectancy to within one year with 95% confidence. How many randomly selected records would they need to have? 23-11 Copyright 2010 Pearson Education, Inc. AP Statistics Quiz B Chapter 23 Key 1. H 0 : = 77 years . The mean life expectancy of the companys patients is 77 years. H A : > 77 years . The mean life expectancy of the 3 companys patients is greater than 77 years. 2 * Randomization condition: The records from the insurance company were randomly sampled. 1 * 10% Condition: 20 records represent less than 10% of the companys records. 70 74 78 82 86 * Nearly Normal condition: The histogram of the ages at death Age at Death is unimodal and reasonably symmetric. This is close enough to Normal for our purposes. Under these conditions, the sampling distribution of the mean can be modeled by Students t with df = n 1 = 20 1 = 19. We will use a one-sample t-test for the mean. We know: n = 20, y = 78.6 years, and s = 4.48 years. s 4.48 SE ( y ) = = = 1.002 years. n 20 y 0 78.6 77 = = 1.597 . t= SE ( y ) 1.002 P = P(t19 > 1.597) = 0.063 The P value of 0.063 is fairly high, so we fail to reject the null hypothesis. The insurance company shouldnt need to increase their premiums because there is little evidence to indicate that people who buy their policies are living longer than before. 2. We wish to find the sample size, n, that would allow a 95% confidence level for the mean life expectancy of a policy holder from the insurance company to have a margin of error of only one year. First estimate: ME = z * SE ( y ) 4.48 n n = 77.1 78 1 = 1.96 Although not necessary, since 78 is quite large, we could find a * better estimate using t75 = 1.992 , from Table T. * ME = t75 SE ( y ) 4.48 n n = 79.6 80 1 = 1.992 23-12 Copyright 2010 Pearson Education, Inc. AP Statistics Quiz C Chapter 23 Name Textbook authors must be careful that the reading level of their book is appropriate for the target audience. Some methods of assessing reading level require estimating the average word length. Weve randomly chosen 20 words from a randomly selected page in Stats: Modeling the World and counted the number of letters in each word: 5, 5, 2, 11, 1, 5, 3, 8, 5, 4, 7, 2, 9, 4, 8, 10, 4, 5, 6, 6 1. Suppose that our editor was hoping that the book would have a mean word length of 6.5 letters. Does this sample indicate that the authors failed to meet this goal? Test an appropriate hypothesis and state your conclusion. 2. For a more definitive evaluation of reading level the editor wants to estimate the texts mean word length to within 0.5 letters with 98% confidence. How many randomly selected words does she need to use? 23-13 Copyright 2010 Pearson Education, Inc. AP Statistics Quiz C Chapter 23 Key 1. H 0 : = 6.5 . The mean word length in the book is 6.5 words. H A : 6.5 . The mean word length in the book is not 6.5 words. * Randomization Condition: We have a random sample of the 8 words in the book. * 10% Condition: The words are less than 10% of all words in 6 the book. 4 * Nearly Normal Condition: A histogram of the observed 2 word lengths looks roughly unimodal and symmetric, so the population of all word lengths may be approximately Normal. Since the conditions are satisfied, we will to use a two-tail, one-sample t-test. y = 5.5 n = 20 s = 2.685 df = 19 5.5 6.5 t= = 1.67 2.865 20 P = 2 P(t19 < 1.67) = 0.11 0 6 Word Length (letters) Because the P-value is so high we do not reject the null hypothesis. This sample does not provide evidence that the average word length differs from the goal of 6.5 letters. 2. First estimate: ME = z * SE ( y ) 2.685 n n = 156.02 157 0.5 = 2.326 Although not necessary, since 157 is quite large, we could find a * better estimate using t140 = 2.353 , from Table T. * ME = t140 SE ( y ) 2.685 n n = 159.66 160 0.5 = 2.353 23-14 Copyright 2010 Pearson Education, Inc. 12 AP Statistics Investigative Task Chapter 23 SAT Performance This is the first of two related Tasks. In these Tasks you will investigate four questions about students SAT scores at one high school. 1. What is the mean SAT-Math score at this high school? 2. How do these students SAT Math scores stack up against the rest of the state? 3. Is there a significant difference between Verbal and Math scores for these students? 4. Nationally males tend to have higher Math scores than females. Is that true at this high school, too? You have a copy of data from the SAT Score Roster sent to the high school by the College Board. It shows gender, Verbal scores, and Math scores, for 303 students in a recent graduating class. NOTE: You will not use all of these data, just a sample of 20 30 students. Your current assignment: investigate the first two questions posed above. Draw a sample of these students, carefully explaining your procedure. Use your sample to create a 95% confidence interval for the mean SAT-I Math score at this high school. Based on your confidence interval, compare the performance of these students with the statewide mean Math score of 503. Prepare a written report that includes complete demonstrations of the statistical procedures you use and your conclusions (in context, of course). Important: Save the Score Roster and your sample. Youll be using them to answer the other two questions in the next Investigative Task coming soon to a Statistics class near you. 23-15 Copyright 2010 Pearson Education, Inc. AP Statistics Investigative Task Chapter 23 This task has 4 components; each is scored as Essentially correct, Partially correct, or Incorrect. 1. The Sample E Selects a random sample, explains the sampling process clearly, calculates the correct summary statistics (perhaps with minor arithmetic errors), and uses the proper vocabulary and notation. P Selects a random sample, but explanation of the process may be unclear or there may be mistakes in notation, or vocabulary. I Sample is not random or the process is not explained or there are several major mistakes in arithmetic, notation, or vocabulary. 2. The Conditions E Cites randomness, <10% of all possible students, and checks normality with a plot. P Discusses normality but omits the plot or fails to mention one of the other conditions. I Misunderstands or omits the conditions, or lists irrelevant issues (np 10). 3. The Mechanics E Identifies the procedure, shows the sample statistics and degrees of freedom, writes the formula using correct critical value and notation, and calculates the correct interval (perhaps with minor arithmetic or rounding errors). P Appears to be doing the proper procedure but omits important information, uses the wrong critical value, uses the wrong notation, or makes major errors in calculations. I Uses the wrong procedure or shows no work or makes several major mistakes. 4. The Interpretation E Correctly interprets the confidence interval in the proper context, and compares the local performance to statewide results. P Writes a conclusion that is correct but not in context or doesnt compare local performance to statewide results. I Does not interpret the confidence interval correctly. Comments: Scoring Es count 1 point, Ps are 1/2 AP score = sum of 4 components; rounding based on quality of P responses Grade: A = 4, B = 3, etc., with +/- based on rounding (ex: 3.5 rounded to 3 is a B+) Name _______________________________ AP Score ____ Grade ____ 23-16 Copyright 2010 Pearson Education, Inc. NOTE: We present a model solution with some trepidation. This is not a scoring key, just an example. Many other approaches could fully satisfy the requirements outlined in the scoring rubric. That (not this) is the standard by which student responses should be evaluated. Model Solution - Investigative Task SAT Performance I want to determine the mean SAT-Math score at this high school. I will take a simple random sample of 25 students by assigning a number 001-300 to each of the students, and then choosing 25 random numbers 001-300, ignoring repeats. I want to find a 95% confidence interval for the mean SAT-Math score of all students at this high school. I have data on the scores of 25 students, from a simple random sample of the 300 students at this high school. Model. 10 Randomization Condition: The students were chosen by a simple random sample. 10% Condition: 25 students represent less than 10% of the population of 300 students. 8 6 4 2 Nearly Normal Condition: The distribution of SAT-Math scores in the sample is unimodal and symmetric. 375 600 Math The conditions are satisfied, so I will use a Students t-model with (n 1) = 25 1 = 24 degrees of freedom to find a one-sample t-interval for the mean. Mechanics. From my sample of 25 students: n = 25 scores y = 602.8 s = 74.3034 74.3034 * y t24 SE ( y ) = 602.8 2.064 = 602.8 30.672 = (572.13, 633.47) 25 Conclusion. I am 95% confident that the interval from 572.13 to 633.47 contains the true mean SAT-Math score for students at this high school. According to my confidence interval, students at this high school had a higher mean SAT-Math score than students nationwide. The national mean of 503 was not included in my 95% confidence interval. 23-17 Copyright 2010 Pearson Education, Inc. 23-18 Copyright 2010 Pearson Education, Inc. AP Statistics SAT Data Gender F F M M M M F M F F M F F M M M F M F M M M F F F M M F F M F F M F M F F F M M M M M M M F F F F F F F Verbal 450 640 590 400 600 610 630 660 660 590 580 500 480 530 690 480 450 650 620 630 490 560 710 360 590 520 510 630 740 680 660 690 540 670 580 600 600 710 640 490 650 580 710 740 510 650 580 600 670 640 370 620 Math 450 540 570 400 590 610 610 570 720 640 650 540 360 520 640 570 420 650 600 740 520 620 720 390 530 630 600 630 760 690 700 750 560 520 720 560 560 640 650 590 680 710 730 700 560 760 660 640 700 670 460 470 Gender M F M M F M F M M M M M M M F F M M M F M F M F M M M F M M M M F F M F F F F F M F M M M F F F M F F F Verbal 680 620 590 430 680 650 650 660 640 460 430 460 800 310 510 770 750 580 410 430 630 630 480 420 800 650 580 560 670 610 510 560 770 530 650 730 690 610 710 730 450 400 700 580 600 630 600 630 690 630 480 680 Math 650 620 690 540 740 650 680 700 680 680 480 600 720 530 550 700 670 630 390 400 680 630 550 430 650 500 690 540 760 740 610 530 690 430 740 790 640 510 680 590 570 410 620 660 670 690 640 700 740 560 500 630 Gender F M M M M F F M M F F M M F F F F M F F F F F F F M F M F M M M F M F M F F M M M M F F M F F F M F F M Verbal 650 460 560 610 620 590 390 510 450 520 470 690 510 770 520 550 680 740 660 740 730 560 570 560 670 650 690 610 500 560 640 430 700 620 610 580 730 520 540 640 680 580 640 700 600 540 480 710 650 640 370 710 Math 680 410 560 760 650 640 440 530 440 690 410 620 540 710 570 600 580 700 690 640 680 630 530 540 520 710 700 740 650 700 650 490 570 670 640 640 570 530 580 610 720 490 630 650 630 510 540 700 780 570 410 700 23-19 Copyright 2010 Pearson Education, Inc. Gender F F M M M F F M F M M F M F M F F M F M M M F M F M M M F F M F F M M M M F M M F M F M F M M F F M F F F F Verbal 630 590 750 600 610 490 680 520 680 650 600 550 490 530 560 630 510 710 550 690 700 540 280 710 640 600 610 680 520 730 510 620 530 550 590 620 640 490 530 560 710 480 490 460 630 510 600 550 670 560 600 650 630 440 Math 660 580 800 690 550 800 610 540 660 700 560 560 390 530 560 590 520 740 560 620 700 620 500 760 710 590 670 670 470 740 680 740 500 680 670 640 570 530 580 600 700 530 490 560 520 520 610 570 580 520 600 530 610 470 Gender M M M M M M F M F F F M M M M F M F M M M F F F M M M F F M F F M M F F F M F F F F F M M F M F M F M M M M Verbal 650 690 360 510 610 700 540 600 610 700 490 390 580 760 730 590 640 610 760 570 700 630 530 490 600 580 710 600 560 570 630 640 550 690 530 580 550 690 420 690 560 560 490 430 690 600 700 630 620 550 600 450 690 590 Math 560 670 290 510 570 740 490 670 710 570 560 630 530 760 760 620 740 620 700 630 740 630 670 700 660 760 700 450 590 690 610 570 600 670 490 640 560 740 410 720 570 560 510 570 670 550 630 710 610 550 670 610 680 600 23-20 Copyright 2010 Pearson Education, Inc. Gender M F M M M F M M F F F M M M F M F M M M M M F F F F F F M F F M M F M M M M M Verbal 760 650 540 580 670 650 660 500 610 580 570 690 570 600 540 640 680 730 610 500 580 620 570 690 610 630 540 760 470 340 480 690 620 710 440 400 650 600 700 Math 580 600 630 530 650 680 660 580 510 500 620 620 540 670 670 760 600 670 550 550 630 710 630 610 570 570 660 690 470 450 500 760 610 630 740 710 740 660 570
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Stanford - STAT - 125
24-1Chapter 24Comparing MeansWhats It About?We extend inference for means to comparing the means of two groups. We look at both confidenceintervals and hypothesis tests. We emphasize that these methods are appropriate only when thegroups are indepen
Stanford - STAT - 125
25-1Chapter 25Paired Samples and BlocksWhats It About?Chapter 25 examines data collected from subjects that have been paired. We want to test whetherthe means of two groups are equal, but we cant use the independent two-sample t-test. We discussthe
Stanford - STAT - 125
AP Statistics Test A Inference Part VIName __ 1. Which of the following is true about Students t-models?I. They are unimodal, symmetric, and bell-shaped.II. They have fatter tails than the Normal model.III. As the degrees of freedom increase, the t-m
Stanford - STAT - 125
26-1Part VII: Inference When Variables Are Related: Chapters 26, 27, 28* and 29*The final Part of the text looks forward as well as backward. We apply just about everything wevelearned. But we also look beyond the course to possible future studies.The
Stanford - STAT - 125
27-1Chapter 27Inferences for RegressionWhats It About?In Chapter 27, we return to the question of associations between quantitative variables, examiningthe regression model, with its underlying assumptions and conditions. We see that that t-modelsde
Stanford - STAT - 125
AP Statistics Test A Inference Part VIIName _1. Suppose you were asked to analyze each of the situations described below. (NOTE: DO NOTDO THESE PROBLEMS!) For each, indicate which inference procedure you would use (fromthe list), the test statistic (z
Stanford - STAT - 125
PS-1Postscript(or. Now What?)Whats It About?Congratulations youve finished the course! But now what? We offer some ideas and suggestionsabout three concerns: reviewing for the AP Exam; advice to students about the exam; what to do after the exam.
Stanford - STAT - 125
28-1Chapter 28Analysis of VarianceWhats It About?We looked at equality of means for two groups in Chapter 24. Here, in Chapter 28, we extend ourconcern to more than two groups. Another way of looking at this question is to think of it as theassociat
Stanford - STAT - 125
29-1Chapter 29Multiple RegressionWhats It About?We first looked at simple regression in Chapters 8, and 9. There we learned about interpretingregression coefficients and R 2 . Then we discussed inference for regression in Chapter 27 andexplained mos
Stanford - STAT - 125
AP* Correlations-1AP* CorrelationsStats: Modeling the World, 3rd Edition AP* Edition 2010Bock, Velleman, De VeauxCorrelated to: Advanced Placement* (AP*) Statistics Standards (Grades 912)I. Exploring Data: Describing patterns and departures from patt
Stanford - STAT - 125
Index-1Index of ApplicationsThis index categorizes examples in the text according to real-world applications.Exercises (E), Just Checking (JC), For Example (FE), Step-By-Step (SBS) and In-TextExamples (IE) are indexed.AgricultureAngus Steers (E) Ch.
Stony Brook University - CSE - 220
CSE 220 Systems-Level ProgrammingEugene W. StarkDepartment of Computer ScienceSUNY at Stony BrookStony Brook, NY 11794-4400 USA631-632-8444stark@cs.sunysb.eduhttp:/www.cs.sunysb.edu/starkBrief Course DescriptionIntroduces systems-level programmin
Stony Brook University - CSE - 220
Computer System OrganizationModern computer systems have a layered organization:Application Level Who? Application programmers What? Application programs Language? High-level language(Java, Python, Prolog, C+, C#, C, etc.) How executed? Typically t
Stony Brook University - CSE - 220
Binary DigitsAll data is ultimately represented in a computer in terms of bits(binary digits). Each bit is either a 0 or a 1. Groups of bits represent higher-level values.1101 1110 1010 1101 1011 1110 1110 1111Why Binary?Binary digits have simple p
Stony Brook University - CSE - 220
FractionsRadix-k fractions assign negative powers of k as the values ofthe places to the right of the radix point.10.0112 = (1 21) + (0 20) + (0 21) + (1 22) + (1 23)= 2 + 0.25 + 0.125= 2.37510Numbers with terminating decimal representations need no
Stony Brook University - CSE - 220
Excess-k EncodingExcess-k encoding is another signed integer encoding that isimportant due to its use in representing oating point exponents. In excess-k (or bias-k) encoding, integer i is represented bythe unsigned encoding of i + k. For example, in
Stony Brook University - CSE - 220
Byte OrderingMemory is byte-addressible, but the CPU manipulates multi-bytevalues, such as 4-byte integer or 8-byte double-precision oatingpoint values.Question: When a 4-byte integer is stored in memory, whichof the four bytes is stored in the lowes
Stony Brook University - CSE - 220
Introduction to C: History In 1970, Ken Thompson of Bell Labs designed the B language for the rst Unix system on the DEC PDP-7. Soon after, the C language was designed by Dennis Ritchieat Bell Labs for implementing Unix on the PDP-11. In 1978, Brian K
Stony Brook University - CSE - 220
C Compiler: What Actually Happened?A lot of stu is hidden behind that one cc command. To buildmore complex C programs, you need know actually happened.1. The C preprocessor processed macros (#define and #include)in the source code (.c le) and produced
Stony Brook University - CSE - 220
Switch StatementsThe switch statement performs a multi-way branch based on thevalue of an integer expression:switch(x % 100) cfw_case 0:case 1:printf(&quot;Case 0-1\n&quot;);break;default:printf(&quot;Case 2-99\n&quot;);break;Switch statements are the same (in no
Stony Brook University - CSE - 220
Call by ValueC (like Java) uses the call-by-value method for passing parameters to functions: An actual parameter expression is evaluated to obtain its Rvalue, which is the actual parameter. The actual parameter is transmitted to to the called function
Stony Brook University - CSE - 220
Array Declarations The declarationint a[10];declares a to be an array capable of storing ten values of typeint. The declarationchar c[100];declares c to be an array capable of storing one hundredvalues of type char. The declarationfloat *p[10];
Stony Brook University - CSE - 220
Multidimensional ArraysC (like Java) supports two- and higher-dimensional arrays.A two-dimensional array is just an array of arrays.int a[10][20]; The array a is stored in a contiguous region of memory, inrow major order. &amp;a[0][0] = a &amp;a[i][0] = a[
Stony Brook University - CSE - 220
StructuresStructures provide a way of combining a collection of relatedvariables together into a single compound variable.Programming with structures involves three activities: Dening structure types:structintcharchar;record cfw_next;surname[N
Stony Brook University - CSE - 220
Pointers to Structures: Arrow NotationJust as for other pointer variables, if p is a pointer to a structure,then *p denotes the structure pointed to by p.struct employee cfw_char name[NAMELEN];int age;float salary; emp1, emp2;struct employee *ep =
Stony Brook University - CSE - 220
Function Prototypes for malloc() and free()The function prototypes for malloc() and free() look a littlestrange:void *malloc(size_t size);void free(void *ptr);In ANSI C, the type void * is a generic pointer type. A pointer of any type can be assigne
Stony Brook University - CSE - 220
Von Neumann ArchitectureThe term von Neumann architecture refers to a particular designmodel for a stored-program digital computer. Separate central processing unit (CPU) and random-accessmemory (RAM). Both instructions and data stored in RAM. Data
Stony Brook University - CSE - 220
Increasing ComplexityFor a long time, computer designers built machines with everlarger and more complex instruction sets. Many dierent kinds of instructions, with complex functions. Many dierent addressing modes (ways to access memory). Each instruc
Stony Brook University - CSE - 220
MIPS Data Transfer InstructionsImmediate FormatLoad byte100000ssssstttttiiiiiiiiiiiiiiiilb Rt,Imm(Rs)Add Rs to sign-extended immediate value to obtain eectiveaddress. Read 8-bit byte from memory at eective address,sign-extend, and place result i
Stony Brook University - CSE - 220
Assembly LanguageEarly computers were programmed directly in binary machinelanguage.0000000100001000010000000010011000100101000010000000000000000001001001010000100000000000000000010010010100001000000000000000000100000001000010000100000000100110 Co
Stony Brook University - CSE - 220
System CallsWhen an application program needs to perform I/O, it does soby making a system call to the operating system. Real operating systems implement hundreds of dierent system calls. The MARS simulator just implements a few that providesimple se
Stony Brook University - CSE - 220
Related Assembler Directives.data.byte A, B, C.half 1, 2, 3.word 1, 2, 3.float 6.02E23.double 6.02E23#####initializeinitializeinitializeinitializeinitializebyte datahalf-word dataword datafloat datadouble dataExample: Saving $ra to M
Stony Brook University - CSE - 220
Summary: Subroutine Call Steps (Caller) Args to registers (a0-a3) or stack (Caller) Save return address (ra) and transfer control (Subr) Allocate space for AR, save sp and fp (Subr) Save callee-save registers (Subr) [do subroutine body] (Subr) Place
Stony Brook University - CSE - 220
Static Structure ReferencesStatic structure references use direct addressing.static struct cfw_int x;int y; s;.s.y = s.x+17;.=&gt;=&gt;s:.data.word 0, 0lwaddisw$t0, s+0$t0, 17$t0, s+4# 0 = offset of x# 4 = offset of yAutomatic Structure Re
Stony Brook University - CSE - 312
CSE/ISE 312:Legal, Social, andEthical Issues inInformation SystemsStony Brook UniversitySpring 2012Section 01Basic InformationCourse DescriptionThis course deals with the impact of computers on us asindividuals and on our society. Rapid changes
Stony Brook University - CSE - 312
IntroductionCSE/ISE 312: Legal, Social, and Ethical Issues in Information SystemsFall 20111Outline of TopicsNew Developments and Their ImpactsIssues and Themes to ConsiderEthics2New Developments and TheirImpacts3The Rapid Paceof ChangeAs tim
Stony Brook University - CSE - 312
Technology and PrivacyCSE/ISE 312SUNY at Stony BrookOutline of TopicsPrivacy and Computer TechnologyBig Brother is Watching YouDiverse Privacy TopicsProtecting PrivacyCommunicationsPrivacy and ComputerTechnologyTheories of PrivacyThe right to
Stony Brook University - CSE - 312
Freedom of SpeechCSE/ISE 312Legal, Social, and Ethical Issues in Information SystemsSUNY at Stony BrookOutline of Topics Changing Communications Paradigms Controlling Offensive Speech Censorship on the Global Network Political Campaign Regulations
Stony Brook University - CSE - 312
Intellectual PropertyCSE/ISE 312Legal, Social, and Ethical Issues in Information SystemsThe Congress shall have Power To.promote the Progress of Science anduseful Arts, by securing for limitedTimes to Authors and Inventors theexclusive Right to the
Stony Brook University - CSE - 312
Computer CrimeCSE/ISE 312 Legal, Social, and Ethical Issues in Information SystemsNew Temptations Computers make many things easier for us This includes illegal activities New environment for old activities: fraud,stock manipulation, forgery, espionag
Stony Brook University - CSE - 312
Computers and WorkCSE/ISE 312 Legal, Social, and Ethical Issues in Information SystemsFears and Questions Computers easily handle boring, repetitivetasks and provide quick, reliable information Fear: computers will replace human workers common fear f
Stony Brook University - CSE - 312
Evaluating and Controlling TechnologyCSE/ISE 312 Legal, Social, and Ethical Issues in Information SystemsEvaluating Info on the Web Much of what we find on the Net is wrong Search engines rank by popularity, not accuracy &quot;Democratic journalism&quot; Wikiped
Stony Brook University - CSE - 312
Errors, Failures, and RiskCSE/ISE 312 Legal, Social, and Ethical Issues in Information SystemsOutline of Topics Failures and Errors in Computer Systems Case Study: The Therac-25 Increasing Reliability and Safety Dependence, Risk, and ProgressFailures
Stony Brook University - CSE - 312
Professional EthicsCSE/ISE 312: Legal, Social, and Ethical Issues in Information SystemsProfessional Ethics Ethical issues a person might encounter onthe job as a computing professional Relationships with and responsibilities toyou and otherscustom
Stony Brook University - CSE - 312
Electronic VotingCSE/ISE 312 Legal, Social, and Ethical Issues in Information SystemsSome information courtesy of Dr. Amanda Stent&quot;Those who cast the votes decide nothing. Those who count the votes decide everything.&quot; - Joseph Stalin (attributed)Votin
Stony Brook University - CSE - 312
CSE/ISE 312: Legal, Social, and Ethical Issues in Information Systems (Section 01)SUNY at Stony Brook, Spring 2012Course DescriptionThis course deals with the impact of computers on us as individuals and on our society. Rapid changes in computing techn
Stony Brook University - CSE - 310
Chapter 1 IntroductionA note on the use of these ppt slides:We're making these slides freely available to all (faculty, students, readers). They're in PowerPoint form so you can add, modify, and delete slides (including this one) and slide content to su
Stony Brook University - CSE - 310
Chapter 2 Application LayerA note on the use of these ppt slides:We're making these slides freely available to all (faculty, students, readers). They're in PowerPoint form so you can add, modify, and delete slides (including this one) and slide content
Stony Brook University - CSE - 310
Chapter 3 Transport LayerA note on the use of these ppt slides:We're making these slides freely available to all (faculty, students, readers). They're in PowerPoint form so you can add, modify, and delete slides (including this one) and slide content to
Stony Brook University - CSE - 310
Chapter 4 Network LayerA note on the use of these ppt slides:We're making these slides freely available to all (faculty, students, readers). They're in PowerPoint form so you can add, modify, and delete slides (including this one) and slide content to s
Stony Brook University - CSE - 310
Chapter 5 Link Layer and LANsA note on the use of these ppt slides:We're making these slides freely available to all (faculty, students, readers). They're in PowerPoint form so you can add, modify, and delete slides (including this one) and slide conten
DeVry Chicago - SPEECH - 275
Shanee BrownProfessor Teresa HayesJanuary 12, 2012Speech 275Listening Activity:Pg61:Exercises for Critical Thinking#2In at least words, using the Listening Self-evaluation form on page 54, undertake a candidevaluation of your major stenghts and wea
Munich Business - BUSINESS - MG301
Chi Square MethodAre u interested in politicsMaleYesNoTotalFemale363470Total921304555100fof1f2f3f4fe3693421fo-fe31.513.538.516.54.5-4.5-4.54.5(fo-fe)2Fo-Fe)2/fe20.250.64320.251.520.250.52620.251.2273.896= 5%
Munich Business - BUSINESS - MG301
How Do Dividends Affect Stock Price?Dividend expectations determine the price an investor pays for a stock.Because dividends are paid quarterly, companies issue dividends based on their net earnings forthe quarter. Although dividends are not guaranteed
Munich Business - BUSINESS - MG301
Terey hotey janam liya hotaKoi mujh sa na doosra hotaSaans letta Tu aur mein jee uth'taKaash Makkah ki mein fiza hotaHijraton mein paraoo hota meinAur Tu khuch dair ko rukaa hotaTerey hujray key Aass pass kaheenMein koi kacha rastaa hotaBeech Taai
Munich Business - BUSINESS - MG301
Political System of PakistanSubmitted By:Ahmed Hassan08-0108ContentsDedicationI want to dedicate this project to all my parents, teachers who grow up at this level. I want todedicate this project to my teacher Mr. Khalilullah Awan who really worked
Munich Business - BUSINESS - MG301
Topic onProduction and CostFunctions and TheirEstimationProduction functionA table, graph, or equation showing the maximumoutput rate of the product that can be achievedfrom any specified set of usage rates of inputsProduction functionThomas Mach
Munich Business - BUSINESS - MG301
CHAPTER3ConsumerBehaviorPrepared by:F e rna ndo &amp; Yvonn Quija noCopyright 2009 Pearson Education, Inc. Publishing as Prentice Hall Microeconomics Pindyck/Rubinfeld, 8e.CHAPTER 3 OUTLINE3.1 Consumer Preferences3.2 Budget ConstraintsChapter 3: Con
Munich Business - BUSINESS - MG301
RET P AHC3Demand, Supply, andDemand,Market EquilibriumMarketP repared by: Fernando QuijanoPreparedand Yvonn Quijanoand 2002 Prentice Hall Business Publishing2002Principles of Economics, 6/eKarl Case, Ray FairThe Basic Decision-Making Units
Munich Business - BUSINESS - MG301
The Consumer TheoryHow Consumers Make Choices underIncome ConstraintsSome Questions What is behind a consumers demandcurve? How do consumers choose from amongvarious consumer goods? What determines the value of a consumergood?Utility The value
Munich Business - BUSINESS - MG301
MarketsandCompetitionMarketsandCompetitionAmarketisagroupofbuyersandsellersofaparticularproduct.Acompetitivemarketisonewithmanybuyersandsellers,eachhasanegligibleeffectonprice.Inmoderneconomics,Amarketisagroupofbuyersandsellersofaparticularproduct