• DocumentCode
    1676333
  • Title

    Support Vector Machines and Random Forests Modeling for Spam Senders Behavior Analysis

  • Author

    Tang, Yuchun ; Krasser, Sven ; He, Yuanchen ; Yang, Weilai ; Alperovitch, Dmitri

  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Unwanted and malicious messages dominate email traffic and pose a great threat to the utility of email communications. Reputation systems have been getting momentum as the solution. Such systems extract email senders behavior data based on global sending distribution, analyze them and assign a value of trust to each IP address sending email messages. We build two models for the classification purpose. One is based on support vector machines (SVM) and the other is random forests(RF). Experimental results show that either classifier is effective. RF is slightly more accurate, but more expensive in terms of both time and space. SVM produces similar accuracy in a much faster manner if given modeling parameters. These classifiers can contribute to a reputation system as one source of analysis and increase its accuracy.
  • Keywords
    IP networks; support vector machines; telecommunication traffic; trees (mathematics); unsolicited e-mail; IP address; electronic mail traffic; global sending distribution; random forests modeling; spam senders behavior analysis; support vector machines; Data mining; Electronic mail; Feedback; Filtering; Helium; Radio frequency; Support vector machine classification; Support vector machines; Unsolicited electronic mail; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Telecommunications Conference, 2008. IEEE GLOBECOM 2008. IEEE
  • Conference_Location
    New Orleans, LO
  • ISSN
    1930-529X
  • Print_ISBN
    978-1-4244-2324-8
  • Type

    conf

  • DOI
    10.1109/GLOCOM.2008.ECP.419
  • Filename
    4698194