cis6930fa11_Potter's_Wheel

cis6930fa11_Potter's_Wheel - PottersWheel: An Interactive...

Info iconThis preview shows pages 1–5. Sign up to view the full content.

View Full Document Right Arrow Icon
Potter’sWheel: An Interactive Data Cleaning System Vijayshankar Raman and Joseph M. Hellerstein University of California at Berkeley
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Overview Current approaches to Data Cleaning Problems in current approaches Potter’s Wheel Approach Potter’s Wheel Architecture Extensible Discrepancy Detection Interactive Transformations Supported Transforms
Background image of page 2
Current Approaches to Data Cleaning Data Cleaning has three Components: Auditing data for finding discrepancies Choosing transformations to fix these, and Applying these transformations on the dataset . Many commercial solutions available Auditing tools Unitech Systems’ ACR/Data Evoke Software’s Migration Architect Transformation tools ETL tools like Data Junction or DataStage by Ascential Software.
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Problems in the current approach Lack of Interactivity Done as a batch process causing long delays User has no idea whether transformations are effective or not. Decoupling transformations and discrepancy detection.
Background image of page 4
Image of page 5
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 11/09/2011 for the course CIS 6930 taught by Professor Staff during the Fall '08 term at University of Florida.

Page1 / 15

cis6930fa11_Potter's_Wheel - PottersWheel: An Interactive...

This preview shows document pages 1 - 5. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online