DocumentCode
1592591
Title
Weighted PageRank algorithm
Author
Xing, Wenpu ; Ghorbani, Ali
Author_Institution
Fac. of Comput. Sci., New Brunswick Univ., Fredericton, NB, Canada
fYear
2004
Firstpage
305
Lastpage
314
Abstract
With the rapid growth of the Web, users easily get lost in the rich hyper structure. Providing the relevant information to users to cater to their needs is the primary goal of Website owners. Therefore, finding the content of the Web and retrieving the users´ interests and needs from their behavior have become increasingly important. Web mining is used to categorize users and pages by analyzing user behavior, the content of the pages, and the order of the URLs that tend to be accessed. Web structure mining plays an important role in this approach. Two page ranking algorithms, HITS and PageRank, are commonly used in Web structure mining. Both algorithms treat all links equally when distributing rank scores. Several algorithms have been developed to improve the performance of these methods. The weighted PageRank algorithm (WPR), an extension to the standard PageRank algorithm, is introduced. WPR takes into account the importance of both the inlinks and the outlinks of the pages and distributes rank scores based on the popularity of the pages. The results of our simulation studies show that WPR performs better than the conventional PageRank algorithm in terms of returning a larger number of relevant pages to a given query.
Keywords
Internet; Web sites; data mining; human factors; Web content mining; Web mining; Web page content; Web page ranking; Web structure mining; Web usage mining; Website owners; user behavior; weighted PageRank algorithm; Communication networks; Computer science; Content based retrieval; Electronic learning; Niobium; Pattern analysis; Topology; Uniform resource locators; Web mining; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Networks and Services Research, 2004. Proceedings. Second Annual Conference on
Print_ISBN
0-7695-2096-0
Type
conf
DOI
10.1109/DNSR.2004.1344743
Filename
1344743
Link To Document