Title :
Spam host classification using swarm intelligence
Author :
Enache, Adriana-Cristina ; Patriciu, Victor Valeriu
Author_Institution :
Fac. of Autom. Control & Comput. Sci., Politeh. Univ., Bucharest, Romania
Abstract :
Web Spam, or Spamdexing, is a form of Search Engine Optimization(SEO) spamming that hinders the efficiency of search engines. These types of exploits use unethical methods in order to place a web page into the first rank. Sabotaging the quality of the results retrieved by search engines can lead users to mistrust the search engine provider. Moreover, spam websites can be a starting point for phishing or malware attacks. Over the last decade Web Spamming has become an important problem. This paper shows a spam host detection approach that uses swarm intelligence. We test our model on two datasets (WEBSPAM-UK2011 and WEBSPAM-UK2007) and show that it can obtain a good accuracy. Moreover, we compared our approach with other popular classifiers (C4.5, SVM and Logistic Regression ) and empirically demonstrated that it can outperform them in some cases.
Keywords :
Internet; optimisation; search engines; swarm intelligence; unsolicited e-mail; WEBSPAM-UK2007; WEBSPAM-UK2011; Web spamming; search engine optimization spamming; spam host classification; spam host detection; spamdexing; swarm intelligence; Accuracy; Data mining; Feature extraction; Particle swarm optimization; Support vector machines; Training; Unsolicited electronic mail; ant colony classification algorithm; host spam; swarm intelligence;
Conference_Titel :
Communications (COMM), 2014 10th International Conference on
Conference_Location :
Bucharest
DOI :
10.1109/ICComm.2014.6866669