WEB MINING2 - WEB MINING ABSTRACT: The web perhaps the...

Info iconThis preview shows pages 1–5. Sign up to view the full content.

View Full Document Right Arrow Icon
WEB MINING   ABSTRACT:
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
The web perhaps the single largest data source in the world. and of course because of its huge amount of information available online, the world wide web is a fertile area for web mining research. the web mining research is at the crossroad of research from several research commities,such as data base, information retrieval and with in AI, especially the sub areas of machine learning and natural language processing. However there Is a lot of confusions when comparing research efforts from different point of views . so ,web mining has seen rapid increase in interest,from both the research and practice communities. to elaborate this in our paper we present a brief introduction of web mining with the necessity of it and gloals of it.later we discuss various types of web mining such as web content mining, web usage mining and web structured mining along with their applications . And then we moved on to various applications ,limitations and disadvantages of web mining .finally we conclude by mentioning some future directions. CONTENTS: Intro Tasks Types Content Stru Usage appli adv dis conclu vth future reference
Background image of page 2
INTRODUCTION: WHAT AND WHY DATA MINING: Web mining is the application of data mining techniques to discover and better serve the needs of web-based applications. Simply to say web mining is datamining techniques applied to WWW. DATA MINING WHY DATA MINING?   The Web—an immense and dynamic collection of pages that includes countless hyperlinks and huge volumes of access and usage information—provides a rich and unprecedented data mining source.However, the Web also poses several challenges toeffective resource and knowledge discovery:
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Web page complexity far exceeds the complexity of any traditional text document collection. Although the Web functions as a huge digital library, the pages themselves lack a uniform structure and contain far more authoring style and content variations than any set ofbooks or traditional text-based documents. Moreover, the tremendous number documentsinthisdigital library have not been indexed, which makes searching the data it contains extremely difficult. The Web constitutes a highly dynamic information source. Not only does the Web continue to grow rapidly, the information it holds also receives constant updates. News, stock market, service center, and corporate sites revise their Web pages regularly. Linkage information and access records also undergo frequent updates. The Web serves a broad spectrum of user communities. The Internet’s rapidly expanding user community connects millions of workstations. These users have markedly different backgrounds, interests, and usage purposes. Many lack good knowledge of the information network’s structure, are unaware of a particular search’s heavy cost, frequently get lost within the Web’s ocean of information, and can chafe at the many access hops and lengthy waits required to retrieve search results. Only a small portion of the Web’s pages contain truly relevant or useful information.
Background image of page 4
Image of page 5
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 7

WEB MINING2 - WEB MINING ABSTRACT: The web perhaps the...

This preview shows document pages 1 - 5. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online