DocumentCode :
2829497
Title :
Novel IPCA-Based Classifiers and Their Application to Spam Filtering
Author :
Rozza, Alessandro ; Lombardi, Gabriele ; Casiraghi, Elena
Author_Institution :
Dipt. di Inf. e Comun., Univ. degli Studi di Milano, Milan, Italy
fYear :
2009
fDate :
Nov. 30 2009-Dec. 2 2009
Firstpage :
797
Lastpage :
802
Abstract :
This paper proposes a novel two-class classifier, called IPCAC, based on the isotropic principal component analysis technique; it allows to deal with training data drawn from mixture of Gaussian distributions, by projecting the data on the Fisher subspace that separates the two classes. The obtained results demonstrate that IPCAC is a promising technique; furthermore, to cope with training datasets being dynamically supplied, and to work with non-linearly separable classes, two improvements of this classifier are defined: a model merging algorithm, and a kernel version of IPCAC. The effectiveness of the proposed methods is shown by their application to the spam classification problem, and by the comparison of the achieved results with those obtained by support vector machines SVM, and K-nearest neighbors KNN.
Keywords :
Gaussian distribution; merging; pattern classification; principal component analysis; unsolicited e-mail; Fisher subspace; Gaussian distributions; isotropic principal component analysis-based classifiers; kernel version; model merging algorithm; spam filtering; training data; Classification algorithms; Clustering algorithms; Covariance matrix; Gaussian distribution; Kernel; Management training; Principal component analysis; Support vector machine classification; Support vector machines; Unsolicited electronic mail; Classification; Isotropic PCA; Kernel methods; Model-Merging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-1-4244-4735-0
Electronic_ISBN :
978-0-7695-3872-3
Type :
conf
DOI :
10.1109/ISDA.2009.21
Filename :
5364038
Link To Document :
بازگشت