DocumentCode
3730431
Title
Content-based SMS spam filtering based on the Scaled Conjugate Gradient backpropagation algorithm
Author
Waddah Waheeb;Rozaida Ghazali;Mustafa Mat Deris
Author_Institution
Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Batu Pahat Johor, Parit Raja 86400, Malaysia
fYear
2015
Firstpage
675
Lastpage
680
Abstract
Content-based filtering is one of the most preferred methods to combat Short Message Service (SMS) spam. Memory usage and classification time are essential in SMS spam filtering, especially when working with limited resources. Therefore, suitable feature selection metric and proper filtering technique should be used. In this paper, we investigate how a learnt Artificial Neural Network with the Scaled Conjugate Gradient method (ANN-SCG) is suitable for content-based SMS spam filtering using a small size of features selected by Gini Index (GI) metric. The performance of ANN-SCG is evaluated in terms of true positive rate against false positive rate, Matthews Correlation Coefficient (MCC) and classification time. The evaluation results show the ability of ANN-SCG to filter SMS spam successfully with only one hundred features and a short classification time around to six microseconds. Thus, memory size and filtering time are reduced. An additional testing using unseen SMS messages is done to validate ANN-SCG with the one hundred features. The result again proves the efficiency of ANN-SCG with the one hundred features for SMS spam filtering with accuracy equal to 99.1%.
Keywords
"Measurement","Feature extraction","Training","Indexes","Artificial neural networks","Backpropagation algorithms","Correlation"
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type
conf
DOI
10.1109/FSKD.2015.7382023
Filename
7382023
Link To Document