4 Pages

syllabus

Course: MSCS 282, Fall 2009
School: Carnegie Mellon
Rating:
 
 
 
 
 

Word Count: 844

Document Preview

282 MSCS -- Data Mining Aug. 29, 2001 1 Syllabus MSCS 282: Data Mining Fall 2001 Call #39254 http://www.mscs.mu.edu/~cstruble/class/mscs282/fall2001 Instructor: Craig A. Struble Office: Office Hours: 386 Cudahy Hall 24 p.m. MW, and by appointment. Class Meets: 5:458:25 p.m., Wednesday, CU118 Overview "Data mining" refers to a collection of techniques for extracting...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> Carnegie Mellon >> MSCS 282

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
282 MSCS -- Data Mining Aug. 29, 2001 1 Syllabus MSCS 282: Data Mining Fall 2001 Call #39254 http://www.mscs.mu.edu/~cstruble/class/mscs282/fall2001 Instructor: Craig A. Struble Office: Office Hours: 386 Cudahy Hall 24 p.m. MW, and by appointment. Class Meets: 5:458:25 p.m., Wednesday, CU118 Overview "Data mining" refers to a collection of techniques for extracting "interesting" relationships and knowledge hidden in data. We will study a variety of these techniques and carry out practical exercises to understand what is and what is not "interesting." In the process, we will identify strengths and weaknesses of each technique. Prerequisites There are no formal prerequisites, but all students are required to have the instructor's consent. Background in at least one of databases, artificial intelligence, machine learning, pattern recognition, algorithms, or statistics will be helpful. Textbook and References Required Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, 2001, Morgan Kaufmann Publishers, ISBN 1-55860-489-8. Communications of the ACM, Special Issue on Data Mining, November 1996. Available online through the library. Communications of the ACM, Special Issue on Knowledge Discovery, November 1999. Available online through the library. Additional material will be on reserve in the library and on the course web site. MSCS 282 -- Data Mining Aug. 29, 2001 2 Grading Your grade will consist of the following components, weighted as shown: Homework Term Project 50% 50% Final letter grades will depend on the overall performance of the class. The table below shows the overall percentage you must acheive to guarantee a given letter grade. Percentage 92% 88% 82% 78% 72% 68% 60% Letter Grade A AB B BC C CD D Writing Expectations Good writing skills are essential for effective communication of your ideas. All work submitted in this class is expected to be in well written English. Poorly written work may have points deducted, worth no more than 5% of the assignment grade. I will correct or point out grammatical and spelling errors in your work. In extreme cases, the work will be returned to be rewritten and resubmitted, with a 10% penalty. If you are not confident in your ability to write English well, you should seek help from the Ott Memorial Writing Center. Visit http://www.marquette.edu/writingcenter/ for more information. Homework Assignments Homework assignments will primarily consist of discussion questions and experiments related to data mining. You will be given 12 weeks to complete a homework assignment. Homework assignments must be submitted in class the day the assignment is due and prepared electronically (e.g., using word processing software). Each assignment is graded on a 100 point scale. Term Project The term project is a group project intended to tie together and concepts techniques discussed in class. Groups may consist of up to 3 students. Your group will write a project proposal, maintain a project web site, carry out the proposed project, and submit a final project report of no more than 15 pages in length. The goal of the project is to produce a report that could be published in a regional conference about data mining or some other trade publication (e.g., Dr. Dobbs). More details about the project will be made available in lecture and on the course web site. The major milestones for the project are listed below. MSCS 282 -- Data Mining Date September 12, 2001 October 3, 2001 October 17, 2001 November 14, 2001 December 5, 2001 December 7, 2001 Aug. 29, 2001 Event Groups Selected Draft Proposal Due Final Proposal Due Progress Report Due Class Presentation Project Due 3 Attendance Policy Each student is expected to attend every lecture. If you miss two lectures, you will be penalized a half letter grade and formally warned of the possibility of receiving a WA (Withdrawal for Excessive Absences) If you miss three lectures, your final grade will be penalized a full letter grade. Missing the fourth lecture will result in being assigned a WA, which cannot be changed to a W. In addition, extreme tardiness to class may be recorded as an absence. Academic Honesty All students are expected to adhere to the standards of student conduct as described in the Community Expectations section of the student handbook. Homework assignments are intended to reflect individual effort in the course. Students may discuss homework assignments in a general way; i.e., discussing the nature of the assignment or providing clarifications. Sharing homework solutions in any f...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Carnegie Mellon - MSCS - 282
Data Mining - Fall 2001 Instructor: Craig A. Struble Homework 3Assigned: Monday, Sep. 17, 2001 Due: Wednesday, Sep. 26, 20011. [20 pts.] Question 6.2 from Han and Kamber. You should include a pseudocode description of your method as well as argume
Carnegie Mellon - MSCS - 282
Data Mining - Fall 2001 Instructor: Craig A. Struble Extra Credit 1Assigned: Monday, Nov. 5, 2001 Due: Friday, Dec. 7, 2001 Below are two extra credit problems related to Progol. A correct answer to one question will replace the lowest homework grad
Carnegie Mellon - MSCS - 282
Data Mining - Fall 2001 Instructor: Craig A. Struble Homework 5Assigned: Monday, Nov. 19, 2001 Due: Friday, Nov. 30, 2001 Suppose that you want to be able to predict when someone will be willing to wait for a table at a restaurant. You have collecte
Carnegie Mellon - MSCS - 282
Data Mining - Fall 2002 Instructor: Craig A. Struble, Ph.D. Homework 3Assigned: Thursday, Nov. 7, 2002 Due: Wednesday, Nov. 20, 20021. [20 pts.] Using the loaded die rolling example from the notes, calculate how many 6's in a row must be rolled be
Carnegie Mellon - GROUP - 282
Word Sense DisambiguationProject Proposal MSCS 282: Text Mining October 12, 2005 Kevin Indrebo Jidong Tao Marek TrawickiProblem Statement(Research Type Project)To examine, evaluate, and build upon current methods for word sense disambiguation (
Carnegie Mellon - GROUP - 282
Final Proposal for Developing an Application in Support of Counter TerrorismBy Akshat Kapoor Qin Wang William Leannah (Group 1)November 2, 2005Project AimsThis group is going to create a text mining system in support of counter terrorism. To ac
Carnegie Mellon - MSCS - 228
Data Mining - Fall 2003 Instructor: Craig A. Struble, Ph.D. Lab 3: Other Classification TechniquesAssigned: September 17, 2003 Due: September 24, 2003IntroductionIn this lab, you will investigate other classification techniques with the labor con
Carnegie Mellon - MSCS - 228
Data Mining - Fall 2003 Instructor: Craig A. Struble, Ph.D. Labs 6 & 7: Association RulesAssigned: November 10, 2003 Due: November 24, 2003IntroductionIn this lab, you will explore association rule mining and evaluate different measures for evalu
Carnegie Mellon - MSCS - 228
Data Mining - Fall 2003 Instructor: Craig A. Struble, Ph.D. Lab 2: Classification with Decision TreesAssigned: September 10, 2003 Due: September 24, 2003IntroductionClassification is one of the major data mining tasks. Although this task is accom
Carnegie Mellon - MSCS - 228
Data Mining - Fall 2003 Instructor: Craig A. Struble, Ph.D. Lab 1: Getting To Know Your DataAssigned: August 27, 2003 Due: September 24, 2003IntroductionThe first step in a data mining project is getting to know your data. In this lab, you will s
Carnegie Mellon - MSCS - 228
MSCS 228 - Data MiningAug. 25, 20031Syllabus MSCS 228: Data MiningFall 2003 Call #27049 http:/www.mscs.mu.edu/~cstruble/class/mscs228/fall2003 Instructor: Craig A. Struble, Ph.D.Office: Office Hours: Phone: Email: 369 Cudahy Hall 11:00 a.m.1:
Carnegie Mellon - MSCS - 228
Why "n-1 and not n"? By Vasant MarurThis write-up is to help clear up the ambiguity involving the calculation of the standard Deviation of the sample.IntroductionThe process of selecting a sample from a given population is to estimate the various
Carnegie Mellon - MSCS - 228
MSCS 228: Data Mining Dr. Craig StrubleSpring 2007, Final Exam closed book, closed notes, two US Letter cheat sheets, calculators OK 100 points 7 problems 8 pagesName:11. [16 pts.] Provide the most appropriate data mining task to use for each
Carnegie Mellon - MSCS - 228
MSCS 282: Data Mining Dr. Craig StrubleFall 2002, Final Exam closed book, closed notes, one US Letter cheat sheet, calculators OK 100 points 30 questionsName:1Multiple Choice [2 pts each] Circle the BEST answer. 1. Suppose you have a data set
Carnegie Mellon - GROUP - 228
Mining the Futures: Predictive Datamining using Financial DataMike Klotz Carl Twining Brian Marx Andrew Vallejos1.ABSTRACTCommodities speculation has existed for as long as there have been organized markets. One of the first uses Galileo found fo
Carnegie Mellon - GROUP - 228
The Molecular Clustering and Classification Predication of Non-Small Cell Lung Cancer based on cDNA Microarray Gene Expression ProfilingXinna Niu nxn@mcw.edu Anca Deselnicu a_deselnicu@yahoo.com Frank long birdbamboo@yahoo.com Xin (Sean) Yan seanyan
Carnegie Mellon - GROUP - 228
Marquette University Student Data MartJack SzetoMarquette University jack.szeto@marquette.eduNeelima TummalaMarquette University neelima.tummala@marquette.eduABSTRACTMarquette University creates a "freeze file" (this freeze file is created on
Carnegie Mellon - GROUP - 228
Classification of Simplified Chinese Characters from scanned sources using a Neural NetworkAaron Gember Marquette University aaron.gember@marquette.edu Elias Peterson Marquette University elias.peterson@marquette.eduIntroductionOptical Character
Carnegie Mellon - MSCS - 230
MSCS 230 - Bioinformatics IAug. 27, 20021Syllabus MSCS 230: Bioinformatics IFall 2002 Call #74614 http:/www.mscs.mu.edu/~cstruble/class/mscs230/fall2002 Instructor: Craig A. Struble, Ph.D.Office: Office Hours: Phone: Email: 386 Cudahy Hall 5:
Carnegie Mellon - COSC - 149
COSC 149 - Operating SystemsJan. 14, 20021Syllabus COSC 149: Operating SystemsSpring 2002 Call #36081 http:/www.mscs.mu.edu/~cstruble/class/cosc149/spring2002 Instructor: Craig A. StrubleOffice: Office Hours: Phone: Email: 386 Cudahy Hall 24
University of Baltimore - IDIA - 618
Introduction to Database Design workshop: March 22-23, 1999 Glossary of Database Terminology Attribute: A characteristic of an entity that we want to record or track or retrieve later; a fact about an object we're interested in. Business Rule: A stat
Carnegie Mellon - COSC - 170
COSC 170 - Compiler Construction Homework 3 - One Reg, Two Reg, Red Reg, Blue Reg Due: Not due. Practice problem for exam. Consider this simple code fragment: L0: a = b = c = r1 = call d = r1 r3 r4 a + 5 func1 r1L1: r1 = d r2 = 0 call func2 e = d +
Carnegie Mellon - HW - 170
COSC 170 - Compiler Construction Homework 2 - When Stack Frames Attack Due: 2009 Mar 30, beginning of lecture To be completed individually. 1. Consider this simple C program: int fibonacci(int *n2, int *n1, int x) { /* printf("fib(%d,%d,%d)\n", *n2,
University of Illinois, Urbana Champaign - ATMOS - 20090321
UPPER AIR CALCULATIONS AND PLOTTING (Ver 5.49-LINUX-X11)Current filename: /mnt/noaaport/nwstg/convert/09032112_upa.wxpDate: 1200Z 21 MAR 09Searching for KBNA.Searching the city database file for: KBNA .Date:1200Z 21 MAR 09Station: KBNAW
SUNY Oswego - ISC - 200801
ISC 320 Exam 2 Page 1 of 8 Name _KEY _Multiple Choice/Short Answer. Choose the single best answer. 3 points each. 1. The logic axiom "(p => q) * (r => s), p V r |= q V s" is equivalent to the CLIPS: (defrule A (p) => (q)
SUNY Oswego - ISC - 200801
ISC 320 Exam 2 Page 1 of 8 Name _ KEY _ 106 points _Multiple Choice/Short Answer. 3 points each. 1. In an AND-OR tree, if the branches from a node are joined with an arc, then the parent is assigned a true value if a) any o
CSU Bakersfield - SPANISH - 427
Spanish 427Dr. Blommers Identification and Study Guide for First Examination 1. Formation of culture. Influencing peoples: a. b. c. d. e. f. Iberians Celts Phoenicians Carthaginians Greeks Romans g. Early Christians (Visigoths and other barbarians)
CSU Bakersfield - SPANISH - 427
<?xml version="1.0" encoding="UTF-8"?> <Error><Code>NoSuchKey</Code><Message>The specified key does not exist.</Message><Key>819109eeff13a3dc1efbeda6c8b00818a173dcae.doc</Key><RequestId>3 54EA717C917EF34</RequestId><HostId>TQ2IAHy4sjPd0+qVNVyytw5z0cZ
CSU Bakersfield - SPANISH - 427
Spanish 427Dr. Blommers Identification and Study Guide for Final Examination I. Hispanic American Independence A. Causes 1. Expulsion of the Jesuits (1766) & Rise of Nationalism 2. American Revolution (1776) a. Alexander Von Humbolt Less Taxes, Bet
Caltech - PHYS - 199
%!PS-Adobe-3.0 %Pages: (atend) %BoundingBox: 89 52 524 718 %HiResBoundingBox: 89.908911 52.285791 523.371191 717.403540 %. %Creator: GNU Ghostscript 651 (pswrite) %CreationDate: 2004/01/23 00:57:50 %DocumentData: Clean7Bit %LanguageLevel: 2 %EndComme
Chester - ECO - 343
FT.com / Home UK - Steady service sector lightens eurozone gloom Friday Apr 8 2005 . All times are London time. Roger Bove Edit Profile
St. Mary MD - MATH - 322
Math 322: Algebra II Spring 2006Instructor: E-mail: Phone: Oce: Oce Hours: Course Web Page: Text: Susan Goldstinesgoldstine@smcm.edux4366 Schaefer 171 Monday, 1:30{2:30; Wednesday, 10:30{11:30; Friday 2:00{3:00http:/www.smcm.edu/users/sgoldstine
St. Mary MD - MATH - 151
Math 151: Calculus 1 Fall 2008 Section SyllabusInstructor: E-mail: Phone: Office: Office Hours: Susan Goldstine x4366 Schaefer 171 Tuesday, 2:303:30; Wednesday, 3:004:00; Friday 3:004:00 and by appointment. Drop-ins are welcome, as long as I happen
St. Mary MD - MATH - 151
MATH 151 - SYLLABUS Calculus ISection 1 2 3 5 6 Professor Susan Goldstine Susan Goldstine Richard Stark Patrick Corn Patrick Corn Day/Time MWF 1:20 MWF 9:20 MWF 10:40 MWF 1:20 MWF 10:40 Location SH 109 SH 134 SS 123 (ARC) SH 165 SH 134 Test/Review W
St. Mary MD - MATH - 281
Math 281: Foundations of MathematicsInstructor: E-mail: Phone: Oce: Oce Hours: Course Web Page: TA: E-mail:Catalog DescriptionSpring 2008 SyllabusSusan Goldstine(rhymes with "line")sgoldstine@smcm.edux4366 Schaefer 171 Monday, 3:00{4:00
St. Mary MD - MATH - 152
Math 152: Calculus 2 Spring 2008 SyllabusInstructor: E-mail: Phone: Office: Office Hours: Susan Goldstine x4366 Schaefer 171 Monday, 3:004:00 PM; Wednesday, 7:008:00 PM; Friday, 3:004:00 PM; and by appointment. Drop-ins are welcome, as long as I hap
St. Mary MD - MATH - 152
Math 152: Calculus 2 Spring 2008 SyllabusInstructor: E-mail: Phone: Office: Office Hours: Susan Goldstine x4366 Schaefer 171 Monday, 3:004:00 PM; Wednesday, 7:008:00 PM; Friday, 3:004:00 PM; and by appointment. Drop-ins are welcome, as long as I hap
Texas A&M - STAT - 652
STATISTICS 652 Instructor: Jeff Hart 1. 2. 3. 4.TEST 1Name: February 27, 2004Answer all questions. Please show the steps in your reasoning and in your calculations. You have one hour to complete this exam. You may use calculators and one formul
Washington - AA - 506
General solutions of the second-order inhomogenous ODE d2y 2 + ( ) + ( ) = ( ) (1) dx Find two solutions, y1 and y2 , of homogenous ODEd2y + ( ) + ( ) = 0 dx 2 Solution of (1) is given by ( ) ( ) ( ) ( ) 2 1 y = - + + + 1 2 1 2 ( )
Washington - AA - 506
Review of lecture 10 Chapter 2 Diffusion of concentrated vorticityImpulsivemotion of a walldiffusion2 a = 2 equationwhere1 at t=0, = 0 ( ) = 2 0 y a a , 00
Washington - AA - 506
Review of lecture 12Diffusionof a line vortexwhere2 a = ( 2 + 1 ) diffusion equation at t=0, = 0 () r a a , 0r' missed ( ) % = ( ) 0 ( )0+ 0 ( 0 ( ) ) 0 ( ) 0 00Solution1 ( )(- 2 + 2 ) ( ) =
Washington - AA - 506
Dirac's Delta function ( ) = 0 , ifxx 0 x=0 ( ) =+, ifwhere a is any negative number including 0- , b is any positive number+ ( ) ( ) = (0) , ( ) ( ) = 0where b and c are any negative numbers++0 ( ) ( ) = 1 (0)2
Wisconsin - ENGR - 462
ECE462 Medical instrumentation1webster@engr.wisc.edu9/1/99IO-2.7 Assume thermistor self heating < 0.1C Assume voltage = 5 V Dissipation coefficient = 2.0 mW/C P V 2 R T = = D .C . D .C . V 2 52 R = = = 1 2 5 ,0 0 0 T ( D . C . ) 0 .1 C ( 0
UC Riverside - CS - 165
CS165 Homework 1Oct 18, 2005Please make sure to provide full details in all your answers. You must abide by the honor code.1Q11. Given the distribution of letters in the English alphabet below, show how to estimate the entropy of English sour
Caltech - E - 060273
LASER INTERFEROMETER GRAVITATIONAL WAVE OBSERVATORYE060273 -A- DDrawing No Rev. GroupSPECIFICATIONSheet 1of2Fused Silica Blank, LASTI Compensation PlateAUTHOR: G. Billingsley CHECKED: D. Coyne, Willems DATE 12-12-06 DCN NO. E060275-00 A
Pima CC - MATH - 092
Elementary Algebra, Math 092 - Spring, 2008 Time: Tuesday, 6:00 pm - 9:00. Classroom: Community Campus, Computer Commons, Room A107 Course Number: MAT092 Course Registration Number (CRN): 22534 Instructor: Dr. Joe Erker E-mail: Joseph.Erker@pima.edu
Pima CC - MATH - 082
Basic Mathematics, Math 082 - Spring, 2008 Time: Tuesday, 6:00 pm - 9:00. Classroom: Community Campus, Computer Commons, Room A107 Course Number: MAT082 Course Registration Number (CRN): 22532 Instructor: Dr. Joe Erker E-mail: Joseph.Erker@pima.edu C
Pima CC - MATH - 092
Elementary Algebra, Math 092 - Fall, 2008 Time: Tuesday, 6:00 pm - 9:00. Classroom: Community Campus, Computer Commons, Room A107 Course Number: MAT092 Course Registration Number (CRN): 12315 Instructor: Dr. Joe Erker E-mail: Joseph.Erker@pima.edu Co
Université du Québec à Montréal - DH - 091484
Adresse des sites interessants http:/www.stephanedemers.com/dance_music_photo.asp http:/www.ozeo.com/ http:/3boom.net/3bigbang.htm http:/www.creatweb.com/moustaki/guestbook00.shtml http:/www.friendsofphotography.org/ http:/www.tokyofriends.com/ http:
Université du Québec à Montréal - DH - 091484
Diminuez vos cots de main d'oeuvre!Que ce soit sur une base permanente, temporaire ou sur appel, saviez-vous que RSI pouvait vous permettre d'conomiser jusqu' 25% sur vos cots de main d'oeuvre. Vous en avez asser des cotisations CSST, Assurance mal
Pima CC - MATH - 082
Basic Mathematics, Math 082 - Fall, 2008 Time: Tuesday, 6:00 pm - 9:00. Classroom: Community Campus, Computer Commons, Room A107 Course Number: MAT082 Course Registration Number (CRN): 12314 Instructor: Dr. Joe Erker E-mail: Joseph.Erker@pima.edu Cou
Pima CC - MATH - 086
Prealgebra, Math 086 - Fall, 2008 Time: Tuesday, 6:00 pm - 9:00. Classroom: Community Campus, Computer Commons, Room A107 Course Number: MAT086 Course Registration Number (CRN): 12316 Instructor: Dr. Joe Erker E-mail: Joseph.Erker@pima.edu Course web
Pima CC - MATH - 086
Prealgebra, Math 086 - Spring, 2008 Time: Tuesday, 6:00 pm - 9:00. Classroom: Community Campus, Computer Commons, Room A107 Course Number: MAT086 Course Registration Number (CRN): 22533 Instructor: Dr. Joe Erker E-mail: Joseph.Erker@pima.edu Course w
Laurentian - BIO - 1010
Biology 1010 Laboratory IntroductionPreface Biology 1010 is an introductory cellular biology course in the core program offered by the Department of Biological Sciences. The Biology 1010 laboratories have been designed to complement the lecture mate
Laurentian - BIO - 1010
The University of LethbridgeBIOLOGY 1010LABORATORY MANUAL Fall, 2005BIOLOGY 1010The Cellular Basis of Life Laboratory ManualFall, 2005Written by: Helena Danyk Department of Biological Sciences University of LethbridgeStudent Name: Lab Day:
Laurentian - BIO - 1010
Midterm exam 2, Biology 1010 Each exam variant 50 points. Questions 1 and 2 are 15 points Questions 3 and 4 are 10 points 1 1. Membrane structure and function. Fluidity of membranes. 2. A review of cellular respiration, main steps. Main energy input
Texas A&M - P - 202
PHYSICS 202 EXAM 4Thursday, April 22, 2004NAME: SECTION:Note: 517 518 519 520517Recitation Recitation Recitation Recitation518Mon 4:10 Wed 10:20 Wed 8:00 Mon 1:50519520There are a total of 11 problems on this test. For each problem,
Texas A&M - P - 202
PHYSICS 202 EXAM 3March 30, 2006 NAME: SECTION: Note: 517 518 519 520 517 Recitation Recitation Recitation Recitation 518 Mon 4:10 Wed 10:20 Wed 8:00 Mon 1:50 519 520There are a total of 11 problems on this test. For each problem, write your answ
SUNY Stony Brook - PHY - 277
Goals for This Lecture: Understand why Euler's method sometimes gives inaccurate answers Develop some more accurate ODE integration methods Develop general purpose ODE codes Compare these methods on the orbital problemAST/PHY 277 Lecture 31Eu
SUNY Stony Brook - AST - 443
AST 443 Research Proposal:Proposal Title: Investigators: Telescope: The Relationship Between the Optical Brightness of the Helix Nebula (NGC 7293) to its Density and Ionization Levels. Keith Godlewski, Raema Obbie, Zeng Fan R/C spectrograph on the S
SUNY Stony Brook - AST - 101
Is Pluto a Planet? And other weighty issuesFrederick M. WalterScience Club of Long Island 20 December 2006What is a Planet?Significant bodies that orbit the Sun The naked eye planets: Earth is a planet Mercury, Venus and Mars are like the Earth