• DocumentCode
    1804765
  • Title

    IdentifyingWeb Spam by Densely Connected Sites and its Statistics in a JapaneseWeb Snapshot

  • Author

    Ono, Hiroshi ; Toyoda, Masashi ; Kitsuregawa, Masaru

  • Author_Institution
    The University of Tokyo, Japan
  • fYear
    2006
  • fDate
    2006
  • Abstract
    Web spamming refers to actions intended to mislead search engines into ranking certain pages higher than they deserve. Recently, the amount of web spam has increased dramatically, leading to a degradation of search results. One of the most effective spamming techniques is link spamming. This is done by setting up an interconnected structure of pages for deceiving link-based ranking methods, such as PageRank. In this paper, we analyze distributions of link spam in our archive of Japanese web pages using link analysis techniques.
  • Keywords
    Data engineering; Data mining; Degradation; Information analysis; Optimization methods; Search engines; Statistics; Toy industry; Unsolicited electronic mail; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops, 2006. Proceedings. 22nd International Conference on
  • Conference_Location
    Atlanta, GA, USA
  • Print_ISBN
    0-7695-2571-7
  • Type

    conf

  • DOI
    10.1109/ICDEW.2006.64
  • Filename
    1623926