12-LinkModel

12-LinkModel - Link Model CS273 Data and Knowledge Bases...

Info iconThis preview shows pages 1–8. Sign up to view the full content.

View Full Document Right Arrow Icon
Link Model S273 ata and Knowledge Bases CS273 - Data and Knowledge Bases Xifeng Yan Computer Science niversity of California at Santa Barbara University of California at Santa Barbara
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Department of Computer Science [Announcement] [Readings] Kleinberg, Jon (1999). "Authoritative sources in a hyperlinked environment" (PDF). Journal of the ACM 46 (5): 604–632. Page, Lawrence; Brin, Sergey; Motwani, Rajeev and Winograd, Terry (1999). The PageRank citation ranking: Bringing order to the Web Data and Knowledge Bases | University of California at Santa Barbara 2
Background image of page 2
Department of Computer Science Web search engines Index-based: search the Web, index Web pages, and build nd store huge keyword ased indices and store huge keyword-based indices Help locate sets of Web pages containing certain keywords Deficiencies A topic of any breadth may easily contain hundreds of thousands of documents Many documents that are highly relevant to a topic may not contain keywords defining them Data and Knowledge Bases | University of California at Santa Barbara 3
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Department of Computer Science HITS – hyperlink induced topic search The Web consists not only of pages, but also of hyperlinks ointing from one page to another pointing from one page to another These hyperlinks contain an enormous amount of latent human annotation Hyperlinks can infer the notion of authority- pages that are not only relevant, but also of high quality, or authoritative on the topic A hyperlink pointing to another Web page, this can be considered as the author's endorsement of the other page Data and Knowledge Bases | University of California at Santa Barbara 4
Background image of page 4
Department of Computer Science Mining the Web's Link Structures Direct count of links to the pages… does not work: Not every hyperlink represents an endorsement Other purposes are for navigation or for paid advertisements If the majority of hyperlinks are for endorsement, the collective opinion will still dominate One authority will seldom have its Web page point to its rival authorities in the same field Authoritative pages are seldom particularly descriptive Difficult to make distinction between authorities and “universal popular” pages (high in-degree for any query) Data and Knowledge Bases | University of California at Santa Barbara 5
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Department of Computer Science HITS: Capturing Authorities & Hubs [Kleinberg 98] Intuitions Pages that are widely cited are good authorities Pages that cite many other pages are good hubs The key idea of HITS (Hypertext-Induced Topic Search) y( y p p) Good authorities are cited by good hubs Good hubs point to good authorities Iterative reinforcement… Many applications in graph/network analysis Data and Knowledge Bases | University of California at Santa Barbara 6 6
Background image of page 6
Department of Computer Science HITS ( H yperlink- I nduced T opic S earch)
Background image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 8
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 01/09/2012 for the course CS CS273 taught by Professor Xifengyan during the Spring '11 term at UCSB.

Page1 / 20

12-LinkModel - Link Model CS273 Data and Knowledge Bases...

This preview shows document pages 1 - 8. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online