• DocumentCode
    3336082
  • Title

    Novel approach: Naïve Bayes with Vector space model for spam classification

  • Author

    Vahora, Safvan ; Hasan, Mosin ; Lakhani, Reshma

  • Author_Institution
    Dept. of Inf. Technol., VGEC, Chandkheda, India
  • fYear
    2011
  • fDate
    8-10 Dec. 2011
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    We always see our normal mail goes into spam folder of the mail box. Interestingly 90% of the time the mail server classifies it perfectly but sometimes it fails due to spammer are getting highly technical. In this paper, we are using novel approach which uses Vector space model with Naïve Bayes to correctly classify mails as spam mail. Naïve Bayes method is used for spam classification but still binding with personalize word vector helps in increasing the accuracy of the system because user receives special type of message only. In this research work, we use vector space model with naïve bayes to classify spam mail. We got nearly 85% of accuracy in spam classification. We have used personalize mail classification option instead of standard global classification because people visiting subjective (i.e. pornographic) sites frequently get spam mail related to that subject (pornography) only and hence personalization shows improved result.
  • Keywords
    unsolicited e-mail; Naive Bayes; mail server; spam classification; vector space model; Postal services; Support vector machine classification; Text categorization; Training; Unsolicited electronic mail; Vectors; Classification; Naïve Bayes; Spam; Spammer; Vector space model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering (NUiCONE), 2011 Nirma University International Conference on
  • Conference_Location
    Ahmedabad, Gujarat
  • Print_ISBN
    978-1-4577-2169-4
  • Type

    conf

  • DOI
    10.1109/NUiConE.2011.6153245
  • Filename
    6153245