• DocumentCode
    678557
  • Title

    An artificial immune system with local feature selection classifier for spam filtering

  • Author

    Kalbhor, Mayank ; Shrivastava, S. ; Ujjainiya, Babita

  • Author_Institution
    Dept. of Inf. Technol., SATI, Vidisha, India
  • fYear
    2013
  • fDate
    4-6 July 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    The Local Concentration based feature extraction approach is take into consideration to be able to very effectively extract position related information from messages by transforming every area of a message to a corresponding LC feature. To include the LC approach into the entire process of spam filtering, a LC model is designed, where two kinds of detector sets are initially generated by using term selection strategies and a well-defined tendency threshold, then a window is applied to divide the message into local areas. After segmentation of the particular message, concentration of the detectors are calculated and brought as the feature for every local area. Finally, feature vector is created by combining all the local feature area. Then appropriate classification method inspired from immune system is applied on available feature vector. To check the performance of model, several experiments are conducted on four benchmark corpora using the cross-validation methodology. It is shown that our model performs well with the Information Gain as term selection methods, LC based feature extraction method with flexible applicability in the real world. In comparison of other global-concentration based feature extraction techniques like bag-of-word the LC approach has better performance in terms of both accuracy and measure. It is also demonstrated that the LC approach with artificial immune system inspired classifier gives better results against all parameters.
  • Keywords
    artificial immune systems; feature extraction; feature selection; pattern classification; unsolicited e-mail; LC based feature extraction method; LC model; appropriate classification method; artificial immune system inspired classifier; benchmark corpora; cross validation methodology; detector sets; feature vector; global concentration based feature extraction; information gain; local concentration based feature extraction; local feature area; local feature selection classifier; spam filtering; tendency threshold; term selection methods; term selection strategies; Feature extraction; Filtering; Immune system; Training; Unsolicited electronic mail; Vectors; AIS; information gain; local concentration; spam filtering; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing, Communications and Networking Technologies (ICCCNT),2013 Fourth International Conference on
  • Conference_Location
    Tiruchengode
  • Print_ISBN
    978-1-4799-3925-1
  • Type

    conf

  • DOI
    10.1109/ICCCNT.2013.6726691
  • Filename
    6726691