Department of Mathematics, IIT Bombay SI 515 (Data Mining): Autumn 2010 Assignment Sheet-II Every question is a Test Assignment for individual groups and carries max. of 10 marks . Q1. Make clusters of Fisher’s iris data using (i) suitable 0-1 Integer programming formulation. You may use SAS-OR, Matlab, or R-software utilities for solving the 0-1 IPP or equivalent NLPP. Compute the accuracy (true positives and false positives for each cluster) with respect to the true class-ids indicated in the data set. Q2 Generate 100 random points in a 2-dimensional space such that approximately 50 points will lie inside a triangle formed by three given intersecting lines (chose the lines so that neither is a coordinate axis). Clearly describe your algorithm for generating these points. Consider the combined set of all points and apply 0-1 Integer programming based clustering technique to form the clusters so that the boundaries of one of the clusters would almost coincide with those of the triangle. Describe the approach used by you for

## This note was uploaded on 03/13/2012 for the course STATISTICS SI406 taught by Professor Rrj during the Spring '12 term at IIT Kanpur.

