DataIntegration - Distributed & Parallel DBMS...

Info iconThis preview shows pages 1–9. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Distributed & Parallel DBMS M. Tamer zsu Page 8.1 Outline Introduction Distributed DBMS Architecture Distributed Database Design Distributed Query Processing Distributed Transaction Management Data Replication Parallel Database Systems Data Integration Systems Schema Matching Schema Integration Schema Mapping Web Search/Querying Peer-to-Peer Data Management Data Stream Management Distributed & Parallel DBMS M. Tamer zsu Page 8.2 Problem Definition Given existing databases with their Local Conceptual Schemas (LCSs), how to integrate the LCSs into a Global Conceptual Schema (GCS) z GCS is also called mediated schema Bottom-up design process Distributed & Parallel DBMS M. Tamer zsu Page 8.3 Integration Alternatives Physical integration z Source databases integrated and the integrated database is materialized z Data warehouses, Enterprise Application Integration (EAI) Logical integration z Global conceptual schema is virtual and not materialized z Enterprise Information Integration (EII) Distributed & Parallel DBMS M. Tamer zsu Page 8.4 Methodology Distributed & Parallel DBMS M. Tamer zsu Page 8.5 GCS/LCS Relationship Local-as-view z The GCS definition is assumed to exist, and each LCS is treated as a view definition over it Global-as-view z The GCS is defined as a set of views over the LCSs Distributed & Parallel DBMS M. Tamer zsu Page 8.6 Database Integration Issues Schema translation z Component database schemas translated to a common intermediate canonical representation Schema creation z Intermediate schemas are used to create a global conceptual schema Distributed & Parallel DBMS M. Tamer zsu Page 8.7 Schema Translation What is the canonical data model? z Relational z Entity-relationship DIKE z Object-oriented ARTEMIS z Graph-oriented DIPE, TranScm, COMA, Cupid Preferable with emergence of XML No common graph formalism Mapping algorithms z These are well-known Distributed & Parallel DBMS M. Tamer zsu Page 8.8 Schema Creation Schema matching z Finding the correspondences between multiple schemas Schema integration z Creation of the GCS (or mediated schema) using the correspondences Schema mapping z How to map data from local databases to the GCS Important: sometimes the GCS is defined first and schema matching and schema mapping is done against this target GCS Distributed & Parallel DBMS...
View Full Document

Page1 / 29

DataIntegration - Distributed & Parallel DBMS...

This preview shows document pages 1 - 9. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online