Each ai corresponds to an acribute problem denition a

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: ng 2012 •  Intra ­source duplicates –  Requires data fusion process to produce a new and enhanced single representa/on •  Inter ­source duplicates Email null john@doe.com null –  Requires less data fusion processing –  Requires annota/ons on the data 15 Spring 2013 Sangmi Lee Pallickara, CS480, Spring 2012 CS480 Principles of Data Management 16 Spring 2013 Defining Duplicates •  Given an rela/on R, the schema of the rela/on as SR = <a1, a2, …, an>. –  Each ai corresponds to an acribute Problem Definition •  A record stored in a rela/on R assigns a value to each acribute.   r = <v1, v2, …, vn>: the simple lists values   r = <(a1, v1), (a2, v2), …, (an ,vn)> : acribute ­value pairs Sangmi Lee Pallickara, CS480, Spring 2012 18 3 2/19/13 CS480 Principles of Data Management Spring 2013 CS480 Principles of Data Management Example Spring 2013 Examp...
View Full Document

This note was uploaded on 02/11/2014 for the course CS 480 taught by Professor Staff during the Spring '08 term at Colorado State.

Ask a homework question - tutors are online