Foundations for statistical inference - Sampling distributions In this lab, we investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. The data We consider real estate data from the city of Ames, Iowa. The details of every real estate transaction in Ames is recorded by the City Assessor’s office. Our particular focus for this lab will be all residential home sales in Ames between 2006 and 2010. This collection represents our population of interest. In this lab we would like to learn about these home sales by taking smaller samples from the full population. Let’s load the data. download.file("", destfile = "ames.RData") load("ames.RData") We see that there are quite a few variables in the data set, enough to do a very in-depth analysis. For this lab, we’ll restrict our attention to just two of the variables: the above ground living area of the house in square feet ( Gr.Liv.Area ) and the sale price ( SalePrice ). To save some effort throughout the lab, create two variables with short names that represent these two variables. area <- ames\$Gr.Liv.Area price <- ames\$SalePrice Let’s look at the distribution of area in our population of home sales by calculating a few summary statistics and making a histogram. summary(area) hist(area) 1.Describe this population distribution.

• Fall '15

