• DocumentCode
    1798474
  • Title

    A novel semi-supervised learning for SMS classification

  • Author

    Ahmed, Ishtiaq ; Donghai Guan ; Teachoong Chung

  • Author_Institution
    Dept. of Comput. Eng., Kyung Hee Univ., Seoul, South Korea
  • Volume
    2
  • fYear
    2014
  • fDate
    13-16 July 2014
  • Firstpage
    856
  • Lastpage
    861
  • Abstract
    In this paper, we propose a novel semi-supervised methodology to detect spam or ham SMSs, using frequent item set mining algorithm Apriori, probabilistic model Naive Bayes and ensemble learning. This paper considers the unbalanced data set problem which means designing of two class SMS classifier using small number of ham and unlabeled dataset only. Using only a few labeled examples with Semi-supervised training is typically unreliable. However, by applying user-specified minimum support and minimum confidence on ham and unlabeled dataset, we gained significant accuracy on classifying SMSs, experimenting on UCI data Repository.
  • Keywords
    data mining; electronic messaging; learning (artificial intelligence); pattern classification; Apriori algorithm; SMS classification; ensemble learning; frequent item set mining algorithm; naive Bayes model; semisupervised learning; Abstracts; Accuracy; Classification algorithms; Information security; Support vector machines; Apriori Algorithm; Ensemble Learning; Ham; Minimum confidence; Minimum support; Naive Bayes classifier; Short Message Service (SMS); spam;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2014 International Conference on
  • Conference_Location
    Lanzhou
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4799-4216-9
  • Type

    conf

  • DOI
    10.1109/ICMLC.2014.7009721
  • Filename
    7009721