# tutor10 - (e Use PROC UNIVARIATE to get a summary for all...

The University of Hong Kong Department of Statistics and Actuarial Sciences STAT1303 DATA MANAGEMENT Tutorial 10 1. File app.xls is the 2009 applicants’ information for PHD program, in department of statistics in XX University. Always there are two steps for processing the application: first, the graduate school will drop some and send the rest to the department; then, the department will determine who should be enrolled. Please complete the following questions based on the EXCEL file: (a) Use PROC IMPORT to create the data set with the name APP , including the variables: AID, TOEFL, GRE and GPA ; (b) The normal scale for GPA is 4. While some others may choose 5. Please transform the corresponding GPA into scale of 4 if they are larger than 4; (c) Print out the ‘silly applicant’ whose sex is neither ‘M’ nor ’F’; (d) Decide the drop list for the graduate school under the following conditions: TOEFL <90, GRE <2000, GPA <3.0. and print it out;

Unformatted text preview: (e) Use PROC UNIVARIATE to get a summary for all the numerical variables for all the applicants; Making use of summary statistics and QQPLOT to state whether the three variables ( GPA , TOEFL, GRE ) are normally distributed. 2. Consider a data set q7: (a) Replace the missing values of X1 and X2 by the means; (b) Replace the missing values of X1 and X2 by the means for each level of GROUP, i.e. use the group mean instead of the overall mean. (c) It is known that the value of X5 depends on the value of X3 and X4 where Log(X5) = X3+0.1X4. Replace the missing values of X5 by the above formula. When X3 or X4 are missing, use the means of X3 and X4 instead. (d) It is known that the value of X6 depends on X7 where X6=u(i)+exp(X7-a(i)), u(i) is the mean of X6 at the i-th level of GROUP, and a(i) is the mean of X7 at the i-th level of GROUP Replace the missing values of X6 by the above formula....
