Problem 1: ER Diagram Design
The company you work for wants to digitize their time cards. You have been asked to design the
database for submitting and approving time cards. Draw the database ER diagram with the following
information:
A timecard should ha
Data Mining
*
*
*
*
New buzzword, old idea.
Inferring new information from already collected data.
Traditionally job of Data Analysts
Computers have changed this.
Far more efficient to comb through data using a machine than eyeballing statistical data.
Da
Classifying Galaxies
Clustering Definition
*
Given a set of data points, each having a set of attributes, and a similarity measure
among them, find clusters such that
*
*
*
Data points in one cluster are more similar to one another.
Data points in separat
The MSD databases
The MSD actually consists of two separate databases:
*
*
the archive database is highly normalized, with thousands of
relationships linking some 400 tables; the deposition database is the
definitive archive for all structural data at MSD
An Introduction to Data Mining
Why Data Mining
*
Creditratings/targetedmarketing:
*
Givenadatabaseof100,000names,whichpersonsaretheleastlikelyto
Identifylikelyresponderstosalespromotions
Frauddetection
*
defaultontheircreditcards?
Whichtypesoftransaction
Homework 2
Note: You have to submit a hardcopy of Homework 2 at the beginning the class on the due
date (Oct. 2).
Specify the following queries in Relational Algebra, Tuple Relational Calculus, and Domain
Relational Calculus, respectively, based on the CO