Chapter 13: Web Mining: Creating Structure Out of Chaos

 < Day Day Up > 



Roderick L. Lee
Pennsylvania State University at Harrisburg, USA

Copyright © 2003, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited.

Abstract

This chapter presents an overview of web mining. The three areas of web mining-Web content mining, Web usage mining, and Web structure mining-are identified. In this chapter specific attention is paid to Web structure mining, which is the study of the link topology. The link topology of the Web is analyzed in the context of a cyber-community in order to explore the connection between the link topology and conferral of authority. Millions, soon to be billions, of people are annotating Web documents, which results in an abundance of information. Herein lies the problem: topic distillation-searching through the sea of documents for relevant information. To address the problem of overabundance and relevancy, models are needed that can assist in creating order at the local level. The hub and spoke model identified in this chapter takes a proactive approach to creating an online community in a centralized or planned fashion and provides control over the architecture of the Web graph. In the end users can be assured with a certain level of confidence that the Web content contained in a hyperlinked community is both accurate and relevant.



 < Day Day Up > 



Managing Data Mining Technologies in Organizations(c) Techniques and Applications
Managing Data Mining Technologies in Organizations: Techniques and Applications
ISBN: 1591400570
EAN: 2147483647
Year: 2003
Pages: 174

flylib.com © 2008-2017.
If you may any questions please contact us: flylib@qtcs.net