Title :
Novel IPCA-Based Classifiers and Their Application to Spam Filtering
Author :
Rozza, Alessandro ; Lombardi, Gabriele ; Casiraghi, Elena
Author_Institution :
Dipt. di Inf. e Comun., Univ. degli Studi di Milano, Milan, Italy
fDate :
Nov. 30 2009-Dec. 2 2009
Abstract :
This paper proposes a novel two-class classifier, called IPCAC, based on the isotropic principal component analysis technique; it allows to deal with training data drawn from mixture of Gaussian distributions, by projecting the data on the Fisher subspace that separates the two classes. The obtained results demonstrate that IPCAC is a promising technique; furthermore, to cope with training datasets being dynamically supplied, and to work with non-linearly separable classes, two improvements of this classifier are defined: a model merging algorithm, and a kernel version of IPCAC. The effectiveness of the proposed methods is shown by their application to the spam classification problem, and by the comparison of the achieved results with those obtained by support vector machines SVM, and K-nearest neighbors KNN.
Keywords :
Gaussian distribution; merging; pattern classification; principal component analysis; unsolicited e-mail; Fisher subspace; Gaussian distributions; isotropic principal component analysis-based classifiers; kernel version; model merging algorithm; spam filtering; training data; Classification algorithms; Clustering algorithms; Covariance matrix; Gaussian distribution; Kernel; Management training; Principal component analysis; Support vector machine classification; Support vector machines; Unsolicited electronic mail; Classification; Isotropic PCA; Kernel methods; Model-Merging;
Conference_Titel :
Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-1-4244-4735-0
Electronic_ISBN :
978-0-7695-3872-3
DOI :
10.1109/ISDA.2009.21