Spam detection through link authorization from neighboring nodes

Author

Opoku-Mensah Eugene;Zhang Fengli;Opare Kwasi Adu-Boahen;Baagyere Edward Yellakuor

Author_Institution

School of Information and Software Engineering, UESTC, Chengdu, China

fYear

2015

fDate

9/1/2015 12:00:00 AM

Firstpage

1

Lastpage

6

Abstract

Current link spam techniques aim at manipulating both good and bad pages to boost their desired target page(s) and attract web surfers. The web structure of today includes links from bad to good pages and vice versa as well as pages of same kind. It is widely known that good pages seldom connect to bad ones, hence, spamming is assumed when such connections occur. Therefore, such good pages are penalized. However, such penalization tend to be unfair since every web page has an honest and dishonest part. Besides, several factors such as pages similarity influences the web hyperlinks distribution. Based on this, the paper proposes Link Authorization Model to detect link spam propagation onto neighboring pages. We design metrics with relevant link and content features to compute the angular similarity between connecting good-bad pages. Then based on the angular similarity, we are able to predict page-links as true or false authorization. Hence, for every false authorization detected, the out-going page receives a penalization by a pre-determined threshold. Our results show an average spamicity of 0.77 and a corresponding demotion of 0.60.

Keywords

"Authorization","Web pages","Search engines","Google","Joining processes","Electronic mail","Computational modeling"

Publisher

ieee

Conference_Titel

e-Technologies and Networks for Development (ICeND),2015 Forth International Conference on

Type

conf

DOI

10.1109/ICeND.2015.7328538

Filename

7328538