Problem 4 (25%): Answer the following questions about the data cleaning and integration process:
a. In real-world data, there are often rows that have missing values for some variables. Describe two methods for dealing with this problem.
b. If we have class labels for our data, how can we use them to help get better estimates when filling in missing values?
c. Describe two issues that may come up during data integration.
Recently Asked Questions
- Suggest two scenarios that depict the potential effect(s) of hospital functions that need performance improvement. Theorize one possible outcome of your
- Levine's car stopped running on the highway, and a passing motorist called a service station for him on a mobile phone. The service station called an
- As the Head of Information Systems at Equity Brokerage, you maintain the computer systems which support all aspects of stock trading accounts at Equity,