• DocumentCode
    3423422
  • Title

    Hyperlink Classification: A New Approach to Improve PageRank

  • Author

    Cun-He, Li ; Ke-Qiang, Lv

  • Author_Institution
    China Univ. of Pet., Dongying
  • fYear
    2007
  • fDate
    3-7 Sept. 2007
  • Firstpage
    274
  • Lastpage
    277
  • Abstract
    Hyperlink structure is widely used in the hypertext classification, but it has not been paid enough attention. We propose a hyperlink classification approach to improve PageRank algorithm which is widely used in the link analysis of search engine. The cause of the topic drift problem is analyzed and the hyperlinks are classified according to their creating motivations and effects. The improved PageRank algorithm is implemented on the open source search engine NUTCH in Chinese Internet. The experimental results show that the improved PageRank algorithm performs better than the standard PageRank.
  • Keywords
    Internet; pattern classification; public domain software; search engines; Chinese Internet; NUTCH; PageRank; hyperlink classification; hypertext classification; link analysis; open source search engine; topic drift problem; Algorithm design and analysis; Application software; Cause effect analysis; Data engineering; Databases; Expert systems; Internet; Petroleum; Robustness; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2007. DEXA '07. 18th International Workshop on
  • Conference_Location
    Regensburg
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-2932-5
  • Type

    conf

  • DOI
    10.1109/DEXA.2007.14
  • Filename
    4312900