assignment2 - describing this dataset using the gain...

Data Mining Assignment #2 CSC592 – Fall ‘05 Problem Statement Given the following dataset with independent variables A1, A2 and dependent variable Class: A1 A2 Class TRUE TRUE yes TRUE TRUE yes TRUE FALSE no FALSE FALSE yes FALSE TRUE no FALSE TRUE no Compute the following items: 1. Compute the entropy of the whole data set. (5pts) 2. Compute the attribute which should be used as the root node of a decision tree
describing this dataset using the gain formula introduced in class. (15pts) Show all your work. Handing in your assignment Please hand in your typewritten assignment during class. The due date is Monday, September 26 th in class.
