DocumentCode
1798474
Title
A novel semi-supervised learning for SMS classification
Author
Ahmed, Ishtiaq ; Donghai Guan ; Teachoong Chung
Author_Institution
Dept. of Comput. Eng., Kyung Hee Univ., Seoul, South Korea
Volume
2
fYear
2014
fDate
13-16 July 2014
Firstpage
856
Lastpage
861
Abstract
In this paper, we propose a novel semi-supervised methodology to detect spam or ham SMSs, using frequent item set mining algorithm Apriori, probabilistic model Naive Bayes and ensemble learning. This paper considers the unbalanced data set problem which means designing of two class SMS classifier using small number of ham and unlabeled dataset only. Using only a few labeled examples with Semi-supervised training is typically unreliable. However, by applying user-specified minimum support and minimum confidence on ham and unlabeled dataset, we gained significant accuracy on classifying SMSs, experimenting on UCI data Repository.
Keywords
data mining; electronic messaging; learning (artificial intelligence); pattern classification; Apriori algorithm; SMS classification; ensemble learning; frequent item set mining algorithm; naive Bayes model; semisupervised learning; Abstracts; Accuracy; Classification algorithms; Information security; Support vector machines; Apriori Algorithm; Ensemble Learning; Ham; Minimum confidence; Minimum support; Naive Bayes classifier; Short Message Service (SMS); spam;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
Conference_Location
Lanzhou
ISSN
2160-133X
Print_ISBN
978-1-4799-4216-9
Type
conf
DOI
10.1109/ICMLC.2014.7009721
Filename
7009721
Link To Document