DocumentCode :
3461907
Title :
An Enhanced Genetic Programming Approach for Detecting Unsolicited Emails
Author :
Trivedi, S.K. ; Dey, Shuvashis
Author_Institution :
Inf. Syst., Indian Inst. of Manage., Indore, India
fYear :
2013
fDate :
3-5 Dec. 2013
Firstpage :
1153
Lastpage :
1160
Abstract :
Identification of unsolicited emails (spams) is now a well-recognized research area within text classification. A good email classifier is not only evaluated by performance accuracy but also by the false positive rate. This research presents an Enhanced Genetic Programming (EGP) approach which works by building an ensemble of classifiers for detecting spams. The proposed classifier is tested on the most informative features of two public ally available corpuses (Enron and Spam assassin) found using Greedy stepwise search method. Thereafter, the proposed ensemble of classifiers is compared with various Machine Learning Classifiers: Genetic Programming (GP), Bayesian, Naïve Bayes (NB), J48, Random forest (RF), and SVM. Results of this study indicate that the proposed classifier (EGP) is the best classifier among those compared in terms of performance accuracy as well as false positive rate.
Keywords :
genetic algorithms; greedy algorithms; pattern classification; search problems; text analysis; unsolicited e-mail; EGP approach; Enron; Spam assassin; classifier ensemble; email classifier; enhanced genetic programming; false positive rate; greedy stepwise search method; informative features; spam detection; text classification; unsolicited emails detection; unsolicited emails identification; Accuracy; Feature extraction; Genetic programming; Support vector machines; Training; Unsolicited electronic mail; Enhanced Genetic Programming; Ensemble; F-Value; False Positive Rate; GP; J48; Machine Learning Classifiers; Performance Accuracy; Probabilistic classifiers; Random Forest; SVM; Sensitivity; Unsolicited Emails;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
Type :
conf
DOI :
10.1109/CSE.2013.171
Filename :
6755352
Link To Document :
بازگشت