HW3 - CS 598 and STAT 598A: Homework 3 Due: 23rd March 2010...

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon
CS 598 and STAT 598A: Homework 3 Due: 23rd March 2010 1. Attempt as many problems as possible 2. No points for random guessing. You have to explain your answers. 3. Mail your source code to vishy@stat.purdue.edu before the class on 23rd of March 2010. You may email a PDF of your reports or hand them to me in the class. No late submissions will be accepted! 4. Program files should be named after the problem (e.g. solution to problem 1 should be problem1.c etc). Include detailed instructions for how to run your code on a Linux machine (e.g. include makefiles, or instructions to run scripts as appropriate) Problem 1 (7 pt) Recall the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection: http://www.ecmlpkdd2006.org/challenge. html . Download an appropriate dataset or datasets (as the case may be) to form a training and test set from the website. Use the word count features like in the case of the Naive Bayes clas- sifier you worked with for Assignment 1. Train a logistic regression algorithm using any two optimization al-
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 2
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 2

HW3 - CS 598 and STAT 598A: Homework 3 Due: 23rd March 2010...

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online