Web-Page-Summarization

Web-Page-Summarization - CSE 450 Web Mining Seminar CSE W b...

Info iconThis preview shows pages 1–5. Sign up to view the full content.

View Full Document Right Arrow Icon
SE Wb Mi i Si CSE 450 Web Mining Seminar Jian Wang
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Introduction y Extract based generic Web page summarization y To utilize extra knowledge to improve Web page mmari ation ie clickthrough dataset summarization, i.e., clickthrough dataset y To build a thematic lexicon for web pages that have no associate query words
Background image of page 2
E pirical Study on Clickthrough Data mpirical Study on Clickthrough ata y The collection of queries is supposed to well reflect the topics of the target Web page. y Whether the query words are related with the topics of the eb page. Web page. y 45.5% of keyword occurs in the query words y 13.1% of query words appear as keyword. y To prove clickthrough data is helpful to summarize Web ages pages. y 58% > 71.3% y 1.48 > 2.0
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Adapted Web Page Summarization Methods . dapted Significant Word (ASW) Method 1. Adapted Significant Word (ASW) Method y Each sentence is assigned a significance factor g g y Significant words are selected according to word frequency
Background image of page 4
Image of page 5
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 08/06/2008 for the course CSE 450 taught by Professor Davison during the Spring '08 term at Lehigh University .

Page1 / 17

Web-Page-Summarization - CSE 450 Web Mining Seminar CSE W b...

This preview shows document pages 1 - 5. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online