DocumentCode
1811975
Title
Designing a web spam classifier based on feature fusion in the Layered Multi-population Genetic Programming framework
Author
Keyhanipour, Amir Hosein ; Moshiri, Behzad
Author_Institution
Center of Excellence, Univ. of Tehran, Tehran, Iran
fYear
2013
fDate
9-12 July 2013
Firstpage
53
Lastpage
60
Abstract
Nowadays, Web spam pages are a critical challenge for Web retrieval systems which have drastic influence on the performance of such systems. Although these systems try to combat the impact of spam pages on their final results list, spammers increasingly use more sophisticated techniques to increase the number of views for their intended pages in order to have more commercial success. This paper employs the recently proposed Layered Multi-population Genetic Programming model for Web spam detection task as well application of correlation coefficient analysis for feature space reduction. Based on our tentative results, the designed classifier, which is based on a combination of easy to compute features, has a very reasonable performance in comparison with similar methods.
Keywords
Internet; Web sites; correlation methods; feature extraction; genetic algorithms; information retrieval; pattern classification; unsolicited e-mail; Web retrieval systems; Web spam classifier; Web spam detection task; Web spam pages; correlation coefficient analysis; feature fusion; feature space reduction; layered multipopulation genetic programming framework; layered multipopulation genetic programming model; Correlation coefficient; Feature extraction; Genetic programming; Sociology; Unsolicited electronic mail; Web pages; Classifier; Layered Multi-Population Genetic Programming; Spam; Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion (FUSION), 2013 16th International Conference on
Conference_Location
Istanbul
Print_ISBN
978-605-86311-1-3
Type
conf
Filename
6641335
Link To Document