Unformatted text preview: Math 127B, Notes for lecture 10 Mrinal Raghupathi Monday, February 16th, 2011 Administrivia The topics to be discussed today is the sample average. We will mostly draw pictures of samples from a box. Reminders 1. Bring questions for Monday’s review. I will be creating a google moderator page for you to post questions and vote. There will be no online quizzes on Monday 2/21 or Wednesday 2/23. 2. Homework 6 is due Thursday, 2/24 in recitation. 3. There will be a quiz in recitation over chapter 23 on 2/24. 1 Simulation and Pictures In this section I am going to describe an experiment I did on my computer that illustrates the ideas we’ve learned so far. We are going to illustrate three things. 1. The histogram for the data and its relation to the histogram for the sample. 2. The histogram for the sums and their relation to the histogram for the averages and the normal curve. 3. The interpretation of confidence plot. 1.1 The data The data comes from the US News and World Report 1995. The data is about college tuition. After some fiddling and fussing I extracted the names and out-of-state tuition figures for the colleges in this dataset. Here is what a typical line of data looks like: 1 Remember that the figures we are about to deal with, are from 1995. Tuition at Vanderbilt was \$17,865 in 1995. The 62nd highest in the dataset. Here are some basic facts about the data. There are 1,281 colleges in the dataset, the most expensive is Middlebury College (\$25,750), the cheapest is Grambling State University (\$1,044). This gives us an idea as to the number of bins we should create for our data. We picked ranges in steps of \$1,0-0, beginning at \$1,000 and going up to \$26,000. There are 25 ranges and here are the counts (frequencies): 5, 24, 50, 106, 130, 132, 122, 123, 113, 95, 87, 64, 57, 33, 29, 31, 22, 29, 23, 2, 1, 0, 0, 1, 2. We draw a histogram of this data....
