• DocumentCode
    146555
  • Title

    Towards filtering spam mails using dimensionality reduction methods

  • Author

    Thomas, Julian ; Raj, Nisha S. ; Vinod, P.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., SCMS Sch. of Eng. & Technol., Ernakulam, India
  • fYear
    2014
  • fDate
    25-26 Sept. 2014
  • Firstpage
    163
  • Lastpage
    168
  • Abstract
    Numerous methods based on the content based filtering is available for email spam identification. Dimensionality of the feature space is recognized as one of the leading factors that affect the efficiency in classifying mails. This study identifies feature selection techniques used in the general text classification for spam filtering. Also, the classification and prediction is performed using different entities of email such as header, body and subject. We present a comparative study of different feature selection methods. Through extensive experiments we demonstrated that Weighted Mutual Information feature selection with header and body of the emails is efficient in email classification.
  • Keywords
    e-mail filters; feature selection; information filtering; pattern classification; text analysis; unsolicited e-mail; content based filtering; dimensionality reduction methods; email classification; email spam identification; feature selection method; feature selection techniques; feature space; filtering spam mails; leading factors; spam filtering; text classification; weighted mutual Information feature selection; Accuracy; Mutual information; Postal services; Support vector machines; Unsolicited electronic mail; Vectors; classifier; dimensionality reduction; feature selection; spam filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-4237-4
  • Type

    conf

  • DOI
    10.1109/CONFLUENCE.2014.6949337
  • Filename
    6949337