Course Hero Logo

Dt mning steps.docx - 1. Set the business objectives: This...

Course Hero uses AI to attempt to automatically extract content from documents to surface to you and others so you can study better, e.g., in search results, to enrich docs, and more. This preview shows page 1 out of 1 page.

1. Set the business objectives: This can be the hardest part of the data mining process, andmany organizations spend too little time on this important step. Data scientists and businessstakeholders need to work together to define the business problem, which helps inform the dataquestions and parameters for a given project. Analysts may also need to do additional researchto understand the business context appropriately.2. Data preparation: Once the scope of the problem is defined, it is easier for data scientists toidentify which set of data will help answer the pertinent questions to the business. Once theycollect the relevant data, the data will be cleaned, removing any noise, such as duplicates,missing values, and outliers. Depending on the dataset, an additional step may be taken toreduce the number of dimensions as too many features can slow down any subsequentcomputation. Data scientists will look to retain the most important predictors to ensure optimal
End of preview. Want to read the entire page?

Upload your study docs or become a

Course Hero member to access this document

Term
Fall
Professor
N/A
Tags
Data Mining, Machine Learning, hardest part of the data mining process

Newly uploaded documents

Show More

Newly uploaded documents

Show More

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture