cis6930fa11_DBLife

cis6930fa11_DBLife - Advancing Collaboration Efforts by...

Info iconThis preview shows pages 1–12. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Advancing Collaboration Efforts by Database Community Presented By Gautam S. Thakur Main Idea Collecting Information is relevant to a group Using specialized database tools and techniques Processing raw data and presenting in a structure form to run queries Sharing and specializing. Agenda DBLife: A Community Information Management Platform for the Database Research Community Computational Journalism A Call to Arms to Database Researchers DBLife: A Community Information Management Platform for the Database Research Community Authors Pedro DeRose Warren Shen Fei Chen Yoonkyong Lee Doug Burdick AnHai Doan Raghu Ramakrishnan Goals and Challenges Community members often want to Discover, monitor, and query entities and relationships in their community. Involves retrieving raw, largely unstructured data from multiple sources (e.g., home pages, DBLP, mailing lists), inferring and monitoring semantic information. Problem Community Information Management Web is simply too big. Difficult to build infrastructures and user bases at the Web scale to uncover more interesting problems and better validate solutions. Examples J. Gray and Jim Gray are same Recognizing a relationship (e.g., co-authoring, advising) Inferring that a relationship has ceased to exist. Solution Cimple Project - joint project between the University of Wisconsin and Yahoo! Research Develop a software platform Data-rich online community can quickly deploy and customize to effectively manage its data. Domains AILife, IRLife etc. Cimples Three Step Approach Start with High Quality Seed Relevant data sources and domain knowledge Entities and relationships of interest Automatic methods to create and maintain an entity-relationship graph of the community. Providing valuable, carefully crafted functionalities helps correct, maintain, and evolve relationships and this ER graph. Generally Speaking Apply database technologies to manage Web data. Circumvent problem by focusing on Web communities, which are in effect mini- Webs. The DBLife System dblife.cs.wisc.edu 1.5 years Nearly 900 data sources, downloads 9,500 pages, or 150+ MB, daily Tracks roughly 335,000 mentions of 16,600 entities DBLife at a SIGMOD-06, CIDR-07. DBLife at a SIGMOD-06, CIDR-07....
View Full Document

This note was uploaded on 11/09/2011 for the course CIS 6930 taught by Professor Staff during the Fall '08 term at University of Florida.

Page1 / 43

cis6930fa11_DBLife - Advancing Collaboration Efforts by...

This preview shows document pages 1 - 12. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online