Problem 4 (25%): Answer the following questions about the data cleaning and integration process:
a. In real-world data, there are often rows that have missing values for some variables. Describe two methods for dealing with this problem.
b. If we have class labels for our data, how can we use them to help get better estimates when filling in missing values?
c. Describe two issues that may come up during data integration.