Title :
Spam host classification using PSO-SVM
Author :
Enache, Adriana-Cristina ; Sgarciu, Valentin
Author_Institution :
Fac. of Autom. Control & Comput. Sci., Politeh. Univ., Bucharest, Romania
Abstract :
Search engines have become a de facto place to start information acquisition on the Internet. Sabotaging the quality of the results retrieved by search engines can lead users to doubt the search engine provider. Spam websites can serve as means of phishing. This paper shows a spam host detection approach that uses support vector machines(SVM) for classification. We create a parallel version of standard Particle Swarm Optimization(PSO) to determine free parameters of the SVM classifier and apply our proposed model to a content web spamming dataset, WEBSPAM-UK2011. Our implementation of the parallel PSO is constructed on a pool of threads and each thread executes tasks associated to a particle from the swarm. Experiments showed that our proposed model can achieve a higher accuracy than regular SVM and outperforms other classifiers (C4.5, Naive Bayes). Furthermore, parallel version of standard Particle Swam Optimization(PSO) can efficiently select parameters for SVM.
Keywords :
Internet; Web sites; parallel algorithms; particle swarm optimisation; search engines; security of data; support vector machines; unsolicited e-mail; Internet; PSO; SVM; Web spamming dataset; particle swarm optimization; phishing; search engines; spam Websites; spam host classification; spam host detection; support vector machines; Accuracy; Kernel; Particle swarm optimization; Sensitivity; Standards; Support vector machines; Unsolicited electronic mail; Particle Swarm Optimization; Support Vector Machine; parallelism; spam host;
Conference_Titel :
Automation, Quality and Testing, Robotics, 2014 IEEE International Conference on
Conference_Location :
Cluj-Napoca
Print_ISBN :
978-1-4799-3731-8
DOI :
10.1109/AQTR.2014.6857840