13DB&IR

13DB&IR - DB/IR/IE CS273 Data and Knowledge Bases...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon
DB/IR/IE S273 ata and Knowledge Bases CS273 - Data and Knowledge Bases Xifeng Yan Computer Science niversity of California at Santa Barbara University of California at Santa Barbara
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Department of Computer Science Database vs Information Retrieval Discussion Strength Weakness Data and Knowledge Bases | University of California at Santa Barbara 2
Background image of page 2
Department of Computer Science DB and IR: Two Parallel Universes atabase Systems Information Retrieval Database Systems canonical application: accounting libraries data type: numbers, short strings text undation: lgebraic / robabilistic / foundation: algebraic / logic based probabilistic / statistics based search Boolean retrieval ranked retrieval paradigm: (exact queries, result sets/bags) (vague queries, result lists) market Oracle, IBM DB2, Google, Yahoo!, MSN, Data and Knowledge Bases | University of California at Santa Barbara 3 leaders: MS SQL Server, etc. Verity, Fast, etc. Slides adopted from Weikum
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Department of Computer Science Why DB&IR Now? – Application Needs • Global health-care management for monitoring epidemics ews archives r journalists press agencies etc Simplify life for application areas like: News archives for journalists, press agencies, etc. Product catalogs for houses, cars, vacation places, etc. Customer support & CRM in insurances, telcom, retail, software, etc. • Bulletin boards for social communities Enterprise search for projects, skills, know-how, etc. Personalized & collaborative earch in digital libraries, Web, etc. e so a ed & co abo at e sea c d g ta b a es, eb, etc • Comprehensive archive of blogs with time-travel search Typical data: isease (DId Name ategory Pathogen ) MLS ategories ( ) Disease (DId, Name, Category , Pathogen …) UMLS-Categories ( … ) Patient (… Age, HId, Date, Report , TreatedDId) Hospital (HId, Address …) Typical query: ymptoms of wine flue nd ported anomalies Data and Knowledge Bases | University of California at Santa Barbara 4 symptoms of swine flue and reported anomalies with young patients in central Europe in the last two weeks
Background image of page 4
Department of Computer Science nstructured Keyword Search on Unstructured search (keywords) IR Systems Search Engines Relational Graphs (IIT Bombay, UCSD, MSR, Hebrew U, CU Hong Kong, Duke U, . ..) uerying entities & Structured search (SQL,XQuery) DB Systems Querying entities & relations from IE (MSR Beijing, UW Seattle, IBM Almaden, UIUC, MPI, … ) Platform desiderata (from app developer‘s viewpoint): Structured data (records) Unstructured data (documents) • Flexible ranking on text, categorical, numerical attributes • cope with „too many answers“ and „no answers“ Ontologies (dimensions, facets) for products, locations, org‘s, etc. • for query rewriting (relaxation, strengthening) Complex queries combining text & structured attributes Path/XQuery Full- ext with ranking Data and Knowledge Bases | University of California at Santa Barbara 5 • High update rate concurrently with high query load XPath/XQuery Full Text with ranking
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Department of Computer Science Why DB&IR Forever?
Background image of page 6
Image of page 7
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 01/09/2012 for the course CS CS273 taught by Professor Xifengyan during the Spring '11 term at UCSB.

Page1 / 25

13DB&IR - DB/IR/IE CS273 Data and Knowledge Bases...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online