DocumentCode :
1798474
Title :
A novel semi-supervised learning for SMS classification
Author :
Ahmed, Ishtiaq ; Donghai Guan ; Teachoong Chung
Author_Institution :
Dept. of Comput. Eng., Kyung Hee Univ., Seoul, South Korea
Volume :
2
fYear :
2014
fDate :
13-16 July 2014
Firstpage :
856
Lastpage :
861
Abstract :
In this paper, we propose a novel semi-supervised methodology to detect spam or ham SMSs, using frequent item set mining algorithm Apriori, probabilistic model Naive Bayes and ensemble learning. This paper considers the unbalanced data set problem which means designing of two class SMS classifier using small number of ham and unlabeled dataset only. Using only a few labeled examples with Semi-supervised training is typically unreliable. However, by applying user-specified minimum support and minimum confidence on ham and unlabeled dataset, we gained significant accuracy on classifying SMSs, experimenting on UCI data Repository.
Keywords :
data mining; electronic messaging; learning (artificial intelligence); pattern classification; Apriori algorithm; SMS classification; ensemble learning; frequent item set mining algorithm; naive Bayes model; semisupervised learning; Abstracts; Accuracy; Classification algorithms; Information security; Support vector machines; Apriori Algorithm; Ensemble Learning; Ham; Minimum confidence; Minimum support; Naive Bayes classifier; Short Message Service (SMS); spam;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
Conference_Location :
Lanzhou
ISSN :
2160-133X
Print_ISBN :
978-1-4799-4216-9
Type :
conf
DOI :
10.1109/ICMLC.2014.7009721
Filename :
7009721
Link To Document :
بازگشت