100%(1)1 out of 1 people found this document helpful
This preview shows page 1 - 4 out of 9 pages.
Problem 1 (10 points):
This problem is an example of data preprocessing needed in a data mining process. Suppose that a hospital tested the age and body fat data for 18 randomly selected adults with the followingresults:Age262629294045505560%fat10.530.58.820.832.426.930.430.233.2Age554560556162637566%fat36.644.530.835.418.104.22.1683.237.7a)(2 points) Draw the box-plots for age and %fat. Interpret the distribution of the data.
c)(2 points) Regardless of the original ranges of the variables, normalization techniques transform the data intonew ranges that allow to compare and use variables on the same scales. What are the values ranges of thefollowing normalization methods? Explain your answer.