• DocumentCode
    2774638
  • Title

    Exploring both Content and Link Quality for Anti-Spamming

  • Author

    Zhang, Lei ; Zhang, Yi ; Zhang, Yan ; Li, Xiaoming

  • Author_Institution
    Peking University, China
  • fYear
    2006
  • fDate
    Sept. 2006
  • Firstpage
    37
  • Lastpage
    37
  • Abstract
    Search engines are playing a more and more important role in discovering information on the web nowadays. Spam web pages, however, are employing various tricks to bamboozle search engines, therefore achieving undeserved ranks. In this paper, we propose a new page importance metric, which takes both the content quality and the link quality into consideration. Based on this metric, we can judge the trust scores of all the web pages using the web link graph. Experimental results running on over 15 million web pages show that our method can filter out spam and identify reputable sites effectively.
  • Keywords
    Computer science; Filling; Information filtering; Information filters; Information retrieval; Iterative algorithms; Laboratories; Search engines; Unsolicited electronic mail; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2006. CIT '06. The Sixth IEEE International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    0-7695-2687-X
  • Type

    conf

  • DOI
    10.1109/CIT.2006.90
  • Filename
    4019859