This preview shows pages 1–2. Sign up to view the full content.
This preview has intentionally blurred sections. Sign up to view the full version.View Full Document
Unformatted text preview: CSE 572 Data Mining, Sprig 2010, Feb 8, 2010 Some sample projects 1. Using clustering technique to mine the nature clusters of Haiti dataset Choose a toolkit such as WEKA, matlab clustering toolbox, etc. to categorize based on the text in reports. Document the tools you consider and the rational of your choice for implementation. Download all of the reports found at http://haiti.ushahidi.com/download/ . Use the tool you selected to cluster the reports into at least 5 clusters based on the text description included in the reports. Demonstrate and report your implementation, document your results, and compare your clusters with the categories implemented at the http://haiti.ushahidi.com/ website. 2. Classifying the Haiti dataset into given categories Using any existing techniques or modified existing or developing new techniques including data preprocessing methods. We will provide the categories to the participating students. Provide in-depth empirical and analytic report to explain the working of your classification algorithm and the implications...
View Full Document
- Spring '02
- Data Mining