Math 127B. Practice Problem April 25, 2011 This is a practice problem for R. The dataset for this practice exam is taken from the US News and World Report 1995. The data concerns colleges in the US that year. The dataset for this problem is called college.csv . It is a comma-separated file and has a header row. Here is what each column indicates. 1. The name of the university 2. The two-letter state 3. Whether the school is public or private. Public is indicated by 1, private by 2. 4. The average math SAT score 5. The average verbal SAT score 6. Number of applications received 7. Number of applicants accepted 8. Undergraduate enrollment 9. In-state tuition 10. Out-if-state tuition (for private schools the in-state and out-of-state) are the same. The dataset does contain missing values, indicated by NA. For example not all schools reported SAT scores. Answer the following questions. 1. How many schools are in the sample. 2. Draw a histograms for each of the following:

(b) The out-of-state tuition for private schools (c) the out-of-state tuition for public schools. Don't forget the labels and the title. 1 3. Are the in-state and out-of-state tuition for public schools related. Draw a scatter plot to answer this question. 4. Compute the mean in- and out-of-state tuition for public colleges [You will need to remove the missing values using the na.rm option to mean] 5. Draw a scatter plot of the tuition at private colleges against the average SAT score. Do these seem related? 6. Are math and verbal SAT scores related? 7. Draw a histogram of the out-of-state tuition for all colleges. 8. In 1995 what were the 10 most expensive colleges in the survey? 9. The acceptance rate of a college is deﬁned to be the percentage of accepted applications. Compute the acceptance rate and add it to the dataset. 10. Find the 10 most selective schools. 2
