{[ promptMessage ]}

Bookmark it

{[ promptMessage ]}

lect12

# lect12 - Lecture outline Classification Decision-tree...

This preview shows pages 1–13. Sign up to view the full content.

Lecture outline Classification Decision-tree classification

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
What is classification?
What is classification? Classification is the task of learning a target function f that maps attribute set x to one of the predefined class labels y

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
What is classification?
Why classification? The target function f is known as a classification model Descriptive modeling: Explanatory tool to distinguish between objects of different classes (e.g., description of who can pay back his loan) Predictive modeling: Predict a class of a previously unseen record

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
credit approval target marketing medical diagnosis treatment effectiveness analysis Typical applications
General approach to classification Training set consists of records with known class labels Training set is used to build a classification model The classification model is applied to the test set that consists of records with unknown labels

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
General approach to classification
Evaluation of classification models Counts of test records that are correctly (or incorrectly) predicted by the classification model Confusion matrix Class = 1 Class = 0 Class = 1 f 11 f 10 Class = 0 f 01 f 00 Predicted Class Actual Class 00 01 10 11 00 11 s prediction of # total s prediction correct # Accuracy f f f f f f + + + + = = 00 01 10 11 01 10 s prediction of # total s prediction wrong # rate Error f f f f f f + + + + = =

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Supervised vs. Unsupervised Learning Supervised learning (classification) Supervision: The training data (observations, measurements, etc.) are accompanied by labels indicating the class of the observations New data is classified based on the training set Unsupervised learning (clustering) The class labels of training data is unknown Given a set of measurements, observations, etc. with the aim of establishing the existence of classes or clusters in the data
Decision Trees Decision tree A flow-chart-like tree structure Internal node denotes a test on an attribute Branch represents an outcome of the test Leaf nodes represent class labels or class distribution Decision tree generation consists of two phases Tree construction At start, all the training examples are at the root Partition examples recursively based on selected attributes Tree pruning Identify and remove branches that reflect noise or outliers Use of decision tree: Classifying an unknown sample Test the attribute values of the sample against the decision tree

This preview has intentionally blurred sections. Sign up to view the full version.

View Full Document
Training Dataset age income student credit_rating buys_computer <=30 high no fair no <=30 high no excellent 31…40 high
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}

### Page1 / 46

lect12 - Lecture outline Classification Decision-tree...

This preview shows document pages 1 - 13. Sign up to view the full document.

View Full Document
Ask a homework question - tutors are online