Title :
Attribute relevance in multiclass data sets using the naive Bayes rule
Author :
Sotoca, J.M. ; Sánchez, J.S. ; Pla, F.
Author_Institution :
Dept. Llenguatges i Sistemes Inf., Univ. Jaume I, Castellon, Spain
Abstract :
Feature selection using the naive Bayes rule is presented for the case of multiclass data sets. In this paper, the EM algorithm is applied to each class projected over the features in order to obtain an estimation of the class probability density function. A matrix of weights per class and feature is then obtained, where it collects the level of relevance of each feature for the different classes. We show different ways to extract this information and compare the behavior of the ranking of relevance obtained applying the naive Bayes and K-NN classifiers.
Keywords :
Bayes methods; estimation theory; feature extraction; matrix algebra; optimisation; pattern classification; probability; EM algorithm; K-nearest neighbour classifiers; feature extraction; feature selection; matrix algebra; multiclass data sets; naive Bayes rule; probability density function estimation; Data mining; Pattern recognition; Probability density function; Programmable logic arrays;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334557