• DocumentCode
    182810
  • Title

    Spam host classification using PSO-SVM

  • Author

    Enache, Adriana-Cristina ; Sgarciu, Valentin

  • Author_Institution
    Fac. of Autom. Control & Comput. Sci., Politeh. Univ., Bucharest, Romania
  • fYear
    2014
  • fDate
    22-24 May 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Search engines have become a de facto place to start information acquisition on the Internet. Sabotaging the quality of the results retrieved by search engines can lead users to doubt the search engine provider. Spam websites can serve as means of phishing. This paper shows a spam host detection approach that uses support vector machines(SVM) for classification. We create a parallel version of standard Particle Swarm Optimization(PSO) to determine free parameters of the SVM classifier and apply our proposed model to a content web spamming dataset, WEBSPAM-UK2011. Our implementation of the parallel PSO is constructed on a pool of threads and each thread executes tasks associated to a particle from the swarm. Experiments showed that our proposed model can achieve a higher accuracy than regular SVM and outperforms other classifiers (C4.5, Naive Bayes). Furthermore, parallel version of standard Particle Swam Optimization(PSO) can efficiently select parameters for SVM.
  • Keywords
    Internet; Web sites; parallel algorithms; particle swarm optimisation; search engines; security of data; support vector machines; unsolicited e-mail; Internet; PSO; SVM; Web spamming dataset; particle swarm optimization; phishing; search engines; spam Websites; spam host classification; spam host detection; support vector machines; Accuracy; Kernel; Particle swarm optimization; Sensitivity; Standards; Support vector machines; Unsolicited electronic mail; Particle Swarm Optimization; Support Vector Machine; parallelism; spam host;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automation, Quality and Testing, Robotics, 2014 IEEE International Conference on
  • Conference_Location
    Cluj-Napoca
  • Print_ISBN
    978-1-4799-3731-8
  • Type

    conf

  • DOI
    10.1109/AQTR.2014.6857840
  • Filename
    6857840