• DocumentCode
    1592591
  • Title

    Weighted PageRank algorithm

  • Author

    Xing, Wenpu ; Ghorbani, Ali

  • Author_Institution
    Fac. of Comput. Sci., New Brunswick Univ., Fredericton, NB, Canada
  • fYear
    2004
  • Firstpage
    305
  • Lastpage
    314
  • Abstract
    With the rapid growth of the Web, users easily get lost in the rich hyper structure. Providing the relevant information to users to cater to their needs is the primary goal of Website owners. Therefore, finding the content of the Web and retrieving the users´ interests and needs from their behavior have become increasingly important. Web mining is used to categorize users and pages by analyzing user behavior, the content of the pages, and the order of the URLs that tend to be accessed. Web structure mining plays an important role in this approach. Two page ranking algorithms, HITS and PageRank, are commonly used in Web structure mining. Both algorithms treat all links equally when distributing rank scores. Several algorithms have been developed to improve the performance of these methods. The weighted PageRank algorithm (WPR), an extension to the standard PageRank algorithm, is introduced. WPR takes into account the importance of both the inlinks and the outlinks of the pages and distributes rank scores based on the popularity of the pages. The results of our simulation studies show that WPR performs better than the conventional PageRank algorithm in terms of returning a larger number of relevant pages to a given query.
  • Keywords
    Internet; Web sites; data mining; human factors; Web content mining; Web mining; Web page content; Web page ranking; Web structure mining; Web usage mining; Website owners; user behavior; weighted PageRank algorithm; Communication networks; Computer science; Content based retrieval; Electronic learning; Niobium; Pattern analysis; Topology; Uniform resource locators; Web mining; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Networks and Services Research, 2004. Proceedings. Second Annual Conference on
  • Print_ISBN
    0-7695-2096-0
  • Type

    conf

  • DOI
    10.1109/DNSR.2004.1344743
  • Filename
    1344743