STA 6166 – Fall 2010 Project 1 Due September 20, 2010 Part 1: Death on the Titanic The data set titanic.dat (titanic.xls) consists of survival status (dead or alive) on the Titanic by age, gender and cabin class, including crew. In this exercise, we actually have the entire population , that is, we have data on every passenger on the Titanic. This provides us with an opportunity to compare sampling estimates with the true value. Using the random number table or a statistical analysis package, draw a random sample of size 10, 20, and 50 from the full data set. Calculate the proportion that survived and construct a confidence interval around it for each sample size. Calculate the proportion that survived for the entire population. Report the difference between the estimates for each sample size and the true value. Calculate the proportion that survived by cabin class. Does survival status appear to be independent of cabin class? Create a new binary gender-age category with outcomes ‘women and children’ and ‘adult male’. Calculate the proportion that survived by ‘women and children’ and

## This note was uploaded on 06/04/2011 for the course STA 6166 taught by Professor Staff during the Fall '08 term at University of Florida.

