DocumentCode
3691921
Title
Spam detection through link authorization from neighboring nodes
Author
Opoku-Mensah Eugene;Zhang Fengli;Opare Kwasi Adu-Boahen;Baagyere Edward Yellakuor
Author_Institution
School of Information and Software Engineering, UESTC, Chengdu, China
fYear
2015
fDate
9/1/2015 12:00:00 AM
Firstpage
1
Lastpage
6
Abstract
Current link spam techniques aim at manipulating both good and bad pages to boost their desired target page(s) and attract web surfers. The web structure of today includes links from bad to good pages and vice versa as well as pages of same kind. It is widely known that good pages seldom connect to bad ones, hence, spamming is assumed when such connections occur. Therefore, such good pages are penalized. However, such penalization tend to be unfair since every web page has an honest and dishonest part. Besides, several factors such as pages similarity influences the web hyperlinks distribution. Based on this, the paper proposes Link Authorization Model to detect link spam propagation onto neighboring pages. We design metrics with relevant link and content features to compute the angular similarity between connecting good-bad pages. Then based on the angular similarity, we are able to predict page-links as true or false authorization. Hence, for every false authorization detected, the out-going page receives a penalization by a pre-determined threshold. Our results show an average spamicity of 0.77 and a corresponding demotion of 0.60.
Keywords
"Authorization","Web pages","Search engines","Google","Joining processes","Electronic mail","Computational modeling"
Publisher
ieee
Conference_Titel
e-Technologies and Networks for Development (ICeND),2015 Forth International Conference on
Type
conf
DOI
10.1109/ICeND.2015.7328538
Filename
7328538
Link To Document