DocumentCode
2909214
Title
Multi-objective spam filtering using an evolutionary algorithm
Author
Dudley, James ; Barone, Luigi ; While, Lyndon
Author_Institution
Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA
fYear
2008
fDate
1-6 June 2008
Firstpage
123
Lastpage
130
Abstract
SpamAssassin is a widely-used open source heuristic-based spam filter that applies a large number of weighted tests to a message, sums the results of the tests, and labels the message as spam if the sum exceeds a user-defined threshold. Due to the large number of tests and the interactions between them, defining good weights for SpamAssassin is difficult: moreover, users with different needs may desire different sets of weights to be used. We have built a multi-objective evolutionary algorithm MOSF that evolves weights for the tests in SpamAssassin according to two independent objectives: minimising the number of false positives (legitimate messages mislabeled as spam), and minimising the number of false negatives (spam messages mislabeled as legitimate). We show that MOSF returns a set of solutions offering a range of setups for SpamAssassin satisfying different userspsila needs, and also that MOSF can derive solutions which beat the existing SpamAssassin weights in both objectives simultaneously. Applying these ideas could substantially increase the usefulness of SpamAssassin and similar systems.
Keywords
evolutionary computation; security of data; unsolicited e-mail; MOSF; SpamAssassin; multi-objective evolutionary algorithm; multi-objective spam filtering; spam messages; user-defined threshold; widely-used open source heuristic; Costs; Evolutionary computation; Filtering; Filters; Internet; Productivity; Switches; Testing; Unsolicited electronic mail; Wikipedia;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-1822-0
Electronic_ISBN
978-1-4244-1823-7
Type
conf
DOI
10.1109/CEC.2008.4630786
Filename
4630786
Link To Document