STP429: Assignment #7: Multicollinearity, Diagnostics, and Subset Selection Dr. Jennifer Broatch Use the Job Proficiency data found in jobpro.txt to answer the following questions. The data are from Applied Linear Models, 4ed. (1996) by Neter, Kutner, Nachtsheim, and Wasserman. A personnel officer in a governmental agency administered four new developed aptitude tests to each of 25 applicants for entry level clerical positions. For the purpose of the study, all 25 applicants were accepted for a positions irrespective of their test scores. After a probationary period, each applicant was rated for proficiency on the job. The scores for the 4 tests ( X 1 ,X 2 ,X 3 ,X 4 ) and the job proficiency scores are recorded in the file jobpro.txt. Use the following data step to read in the data. data jobpro; title ’Job Proficiency ’; * READ IN THE DATA; infile ’M:. ....JOBPRO.txt’; input y X1 X2 X3 x4; run; 1. Prepare scatterplots and correlation matrix. Also obtain MC diagnostics and comment. 2. Fit a multiple regression containing all 4 variables as first order terms. Does it appear that
