DocumentCode :
3584668
Title :
An Experimental Study for Assessing Email Classification Attributes Using Feature Selection Methods
Author :
Qabajeh, Issa ; Thabtah, Fadi
Author_Institution :
Comput. & Inf., De Montfort Univ., Leicester, UK
fYear :
2014
Firstpage :
125
Lastpage :
132
Abstract :
Email phishing classification is one of the vital problems in the online security research domain that have attracted several scholars due to its impact on the users payments performed daily online. One aspect to reach a good performance by the detection algorithms in the email phishing problem is to identify the minimal set of features that significantly have an impact on raising the phishing detection rate. This paper investigate three known feature selection methods named Information Gain (IG), Chi-square and Correlation Features Set (CFS) on the email phishing problem to separate high influential features from low influential ones in phishing detection. We measure the degree of influentially by applying four data mining algorithms on a large set of features. We compare the accuracy of these algorithms on the complete features set before feature selection has been applied and after feature selection has been applied. After conducting experiments, the results show 12 common significant features have been chosen among the considered features by the feature selection methods. Further, the average detection accuracy derived by the data mining algorithms on the reduced 12-features set was very slight affected when compared with the one derived from the 47-features set.
Keywords :
data mining; pattern classification; security of data; unsolicited e-mail; CFS; chi-square; correlation features set; data mining algorithm; e-mail classification attributes; e-mail phishing classification; electronic mail; feature selection method; influentially degree; information gain; online security research domain; phishing detection rate; user payment; Accuracy; Classification algorithms; Correlation; Data mining; Electronic mail; Feature extraction; Training data; Data mining; Email Classification; Online Security; Phishing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computer Science Applications and Technologies (ACSAT), 2014 3rd International Conference on
Type :
conf
DOI :
10.1109/ACSAT.2014.29
Filename :
7076881
Link To Document :
بازگشت