CSC 5120 Assignment 3 (Fall 2009) Due Date and Time: 3:00 p.m. 3 Dec, 2009 Question 1: Consider an R-Tree with maximum node size M=4 and minimum node size m=2. Assume that the database contains the fo
CSC-5120-09
CSC-5120 Assignment 1 (Fall 2009) Due Date and Time: 3:00 p.m. 20th Oct, 2009
1.
(a) Suppose we modify the 2-Phase Commit protocol as follows. When a site s i votes NO, it sends the messag
Model Answer for Assignment 2 1 Step 1: Find all the confident rules for a given minimum confidence. ( Given a minimum confidence minconf, a rule is confident if conf(xc) minconf, where conf(xc) is th
Model Answer for Assignment 1 1(a)
q0
END Re ady ?0i (1im )
qi Re ady ?0i Yesi 0
Re ady ?0i Noij ( 0 jm )
w0
Or ( Noi 0 )
And (Yesi 0 ) And (Commit0i )
a0
Timeout
wi ai Or ( Noij ( j i ,1 j m ) Comm
CSE-5120-Fall-2009
Client
Client
Data Warehousing
Decision support systems (DSS) in business Also called On line analytical Processing (OLAP) (vs OLTP : On line transaction processing) Many corporati
CSE-5120-Fall-2009
To reduce the number of dimensions Eigenvalues and Eigenvectors Karhunen-Loeve Expansion A number is called an eigenvalue (or characteristic value) of a n n matrix A if there exists
CSE-5120-Fall-2009
An R-tree for data points
R-Trees: Index structure for Spatial Searching
Guttman, SIGMOD 1984
D
K F G J
B
I
A
R-tree : a height-balanced tree has some similarity to a B-tree records
CSE-5120-fall-2009
Sample queries primary key Find the employee record with emp = 123.
Indexing Multimedia Databases
secondary key Find the employee records with salary = 40K. text Find the documents
CSE-5120-Fall-2009
Income
u4 u5
Subspace Clustering
Agrawal, Gehrke, etc al, SIGMOD 98
u1 u2 u3
Age
Data : points in a multiple dimensional space. Each dimension is partitioned into intervals Unit: in
CSE-5120-Fall-2009
Association Rule purchase(T, bread), purchase(T,butter) purchase(T,milk) Clustering Group a set of data based on the conceptual clustering principle: maximize the intraclass simila
CSE-5120-Fall-2009
To guarantee conict serializability: REPLICATED DATA 2-Phase Locking
Multiple copies of some data items are stored at multiple sites. One copy serializability: Multiple copies of an
CSE-5120-Fall-2009
accesses an account from a site dierent from the initiation site or accesses accounts in several dierent sites
Distributed Databases
Chapter 18(19) of Book: Database Systems Concep
CSC-5120-09
CSC-5120 Assignment 2 (Fall 2009) Due Date and Time: 3:00 p.m. 12th Nov, 2009
1. In a classication problem we are given a relational table with a set of attributes a 1 , ., am where a1 is