{[ promptMessage ]}

Bookmark it

{[ promptMessage ]}

web10 - Web as Network A Case Study Networked Life CIS 112...

Info iconThis preview shows pages 1–5. Sign up to view the full content.

View Full Document Right Arrow Icon
Web as Network: A Case Study Networked Life CIS 112 Spring 2010 Prof. Michael Kearns
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
The Web as Network C o ns ide r the  we b as  a ne two rk ve rtic e s : individual (htm l) pag e s e dg e s : hype rlinks  be twe e n pag e s will vie w as  bo th a  directed  and  undirected  graph What is the structure of this network? connected components degree distributions etc. What does it say about the people building and using it? page and link generation visitation statistics What are the  algorithmic  consequences? web search community identification
Background image of page 2
Graph Structure in the Web [Broder et al. paper] Re po rt o n the  re s ults  o f two  m as s ive  “we b c rawls ” Exe c ute d by AltaVis ta in May and O c to be r 1999 De tails  o f the  c rawls : auto m ate d s c ript fo llo wing  hype rlinks  (URLs ) fro m  pag e s  fo und larg e  s e t o f s tarting  po ints  c o lle c te d o ve r tim e c rawl im ple m e nte d as   breadth-first search have to deal with webspam, infinite paths, timeouts, duplicates,  etc. May ’99 crawl: 200 million pages, 1.5 billion links Oct ’99 crawl: 271 million pages, 2.1 billion links Unaudited, self-reported  Sep ’03 stats: 3 major search engines claim > 3 billion pages indexed
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
Five Easy Pieces Autho rs  did two  kinds  o f bre adth-firs t s e arc h: ig no ring  link dire c tio n    weak  connectivity only following forward links    strong
Background image of page 4
Image of page 5
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}