DocumentCode
2774638
Title
Exploring both Content and Link Quality for Anti-Spamming
Author
Zhang, Lei ; Zhang, Yi ; Zhang, Yan ; Li, Xiaoming
Author_Institution
Peking University, China
fYear
2006
fDate
Sept. 2006
Firstpage
37
Lastpage
37
Abstract
Search engines are playing a more and more important role in discovering information on the web nowadays. Spam web pages, however, are employing various tricks to bamboozle search engines, therefore achieving undeserved ranks. In this paper, we propose a new page importance metric, which takes both the content quality and the link quality into consideration. Based on this metric, we can judge the trust scores of all the web pages using the web link graph. Experimental results running on over 15 million web pages show that our method can filter out spam and identify reputable sites effectively.
Keywords
Computer science; Filling; Information filtering; Information filters; Information retrieval; Iterative algorithms; Laboratories; Search engines; Unsolicited electronic mail; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Technology, 2006. CIT '06. The Sixth IEEE International Conference on
Conference_Location
Seoul
Print_ISBN
0-7695-2687-X
Type
conf
DOI
10.1109/CIT.2006.90
Filename
4019859
Link To Document