VirtualDatabases-1

VirtualDatabases-1 - CS345 DataMining VirtualDatabases...

Info iconThis preview shows pages 1–6. Sign up to view the full content.

View Full Document Right Arrow Icon
    CS345 Data Mining Virtual Databases
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Example Find marketing manager openings in  Internet companies so that my commute is  shorter than 10 miles.  Web Structured queries e.g., in SQL Virtual Relations
Background image of page 2
Applications Comparison shopping shopping.com, fatlens, mobissimo,… Job search indeed.com, simplyhired,… Classifieds Search oodle Integrating web data with relational  enterprise apps purchasing, pricing,…
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Wrappers Extract tuples from a single website Assume website is a static collection of  pages i.e., no forms Website Wrapper Relation
Background image of page 4
Why can’t we use DIPRE or Snowball? Can’t assume that the same tuple can be  found on many different websites Need to extract  all  the tuples from each  website May need to normalize data values across  websites Data may be behind forms Need to account for  query capabilities  of  websites
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 6
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 15

VirtualDatabases-1 - CS345 DataMining VirtualDatabases...

This preview shows document pages 1 - 6. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online