Data Mining Assignment #1 CSC592 – Fall ‘05 Overview You are to construct a decision tree for the given data set (tennis.arff – see course website) using the ID3 decision tree algorithm in the Weka toolset. The dependent variable for the tree is ‘play’. Once you have constructed the tree use the tree and the data Weka provides about the tree to answer the questions below. Questions 1. Does the tree adequately describe the data? Why? Why not? 2. Use the decision tree to figure out the value for the dependent variable for the following instance: outlook temperature humidity windy play rainy hot high false ? Justify your answer! Instructions Start the Weka explorer.
