DocumentCode
3432726
Title
Multilayer perceptrons neural network based Web spam detection application
Author
Kwang Leng Goh ; Singh, A.K. ; King Hann Lim
Author_Institution
Sarawak Campus, Dept. of Electr. & Comput. Eng., Curtin Univ., Miri, Malaysia
fYear
2013
fDate
6-10 July 2013
Firstpage
636
Lastpage
640
Abstract
Web spam detection is a crucial task due to its devastation towards Web search engines and global cost of billion dollars annually. For these reasons, a multilayered perceptrons (MLP) neural network is presented in this paper to improve the Web spam detection accuracy. MLP neural network is used for Web spam classification due to its flexible structure and non-linearity transformation to accommodate latest Web spam patterns. An intensive investigation is carried out to obtain an optimal number of hidden neurons. Both Web spam link-based and content-based features are fed into MLP network for classification. Two benchmarking datasets - WEBSPAM-UK2006 and WEBSPAM-UK2007 are used to evaluate the performance of the proposed classifier. The overall performance is compared with the state of the art support vector machine (SVM) which is widely used to combat Web spam. The experiments have shown that MLP network outperforms SVM up to 14.02% on former dataset and up to 3.53% on later dataset.
Keywords
Internet; multilayer perceptrons; pattern classification; unsolicited e-mail; MLP neural network; SVM; Web search engines; Web spam classification; Web spam detection; Web spam link-based features; content-based features; multilayer perceptron neural network; nonlinearity transformation; support vector machine; Biological neural networks; Classification algorithms; Neurons; Supervised learning; Support vector machines; Unsolicited electronic mail; Neural Network; Scaled Conjugate Gradient; Support Vector Machines; Web Spam;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Information Processing (ChinaSIP), 2013 IEEE China Summit & International Conference on
Conference_Location
Beijing
Type
conf
DOI
10.1109/ChinaSIP.2013.6625419
Filename
6625419
Link To Document