• DocumentCode
    2909214
  • Title

    Multi-objective spam filtering using an evolutionary algorithm

  • Author

    Dudley, James ; Barone, Luigi ; While, Lyndon

  • Author_Institution
    Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA
  • fYear
    2008
  • fDate
    1-6 June 2008
  • Firstpage
    123
  • Lastpage
    130
  • Abstract
    SpamAssassin is a widely-used open source heuristic-based spam filter that applies a large number of weighted tests to a message, sums the results of the tests, and labels the message as spam if the sum exceeds a user-defined threshold. Due to the large number of tests and the interactions between them, defining good weights for SpamAssassin is difficult: moreover, users with different needs may desire different sets of weights to be used. We have built a multi-objective evolutionary algorithm MOSF that evolves weights for the tests in SpamAssassin according to two independent objectives: minimising the number of false positives (legitimate messages mislabeled as spam), and minimising the number of false negatives (spam messages mislabeled as legitimate). We show that MOSF returns a set of solutions offering a range of setups for SpamAssassin satisfying different userspsila needs, and also that MOSF can derive solutions which beat the existing SpamAssassin weights in both objectives simultaneously. Applying these ideas could substantially increase the usefulness of SpamAssassin and similar systems.
  • Keywords
    evolutionary computation; security of data; unsolicited e-mail; MOSF; SpamAssassin; multi-objective evolutionary algorithm; multi-objective spam filtering; spam messages; user-defined threshold; widely-used open source heuristic; Costs; Evolutionary computation; Filtering; Filters; Internet; Productivity; Switches; Testing; Unsolicited electronic mail; Wikipedia;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-1822-0
  • Electronic_ISBN
    978-1-4244-1823-7
  • Type

    conf

  • DOI
    10.1109/CEC.2008.4630786
  • Filename
    4630786