s10-integration

s10-integration - Kambhampati & Information...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Kambhampati & Information Integration on the Web 1 Information Integration 4/5 Kambhampati & Information Integration on the Web 2 Information Integration on the Web AAAI Tutorial (SA2) Rao Kambhampati & Craig Knoblock Monday July 22nd 2007. 9am-1pm S l i d e s f o r P a r t s 1 a n d 5 a r e A v a i l a b l e i n h a r d c o p y a t t h e F r o n t o f t h e r o o m Kambhampati & Information Integration on the Web 3 Overview Motivation & Models for Information Integration [30 ] Models for integration Semantic Web Getting Data into structured format [30] Wrapper Construction Information Extraction Getting Sources into alignment [30] Schema Mapping Source Modeling Getting Data into alignment [30] Blocking Record Linkage Processing Queries [45] Autonomous sources; data uncertainty.. Plan Execution Wrapup [15] Q u e r y Q u e r y Services Webpages Structured data Sensors (streaming Data) Services Webpages Structured data Sensors (streaming Data) Executor Needs to handle Source/network Interruptions, Runtime uncertainity, replanning Source Fusion/ Query Planning Needs to handle: Multiple objectives, Service composition, Source quality & overlap Source Trust Ontologies; Source/Service Descriptions R e p l a n n i n g R e q u e s t s P r e f e r e n c e / U t i l i t y M o d e l Answers Probing Queries S o u r c e C a l l s Monitor U p d a t i n g S t a t i s t i c s Executor Needs to handle Source/network Interruptions, Runtime uncertainity, replanning Source Fusion/ Query Planning Needs to handle: Multiple objectives, Service composition, Source quality & overlap Source Trust Ontologies; Source/Service Descriptions R e p l a n n i n g R e q u e s t s P r e f e r e n c e / U t i l i t y M o d e l Answers Probing Queries S o u r c e C a l l s Monitor U p d a t i n g S t a t i s t i c s Kambhampati & Information Integration on the Web 4 Information Integration Combining information from multiple autonomous information sources And answering queries using the combined information Many Applications WWW: Comparison shopping Portals integrating data from multiple sources B2B, electronic marketplaces Mashups, service composion Science informatics Integrating genomic data, geographic data, archaeological data, astro-physical data etc. Enterprise data integration An average company has 49 different databases and spends 35% of its IT dollars on integration efforts Deployed information integration systems Travel sites: Kayak, Expedia etc.. Google Base, DBPedia Map Mashups Libra; Citeseer; Google-squared etc Kambhampati & Information Integration on the Web 5 Kambhampati & Information Integration on the Web 6 Deep Web as a Motivation for II The surface web of crawlable pages is only a part of the overall web....
View Full Document

This note was uploaded on 03/11/2012 for the course CSE 494 taught by Professor Rao during the Spring '08 term at ASU.

Page1 / 95

s10-integration - Kambhampati & Information...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online