Title :
OFFSS: optimal fuzzy-valued feature subset selection
Author :
Tsang, E.C.C. ; Yeung, D.S. ; Wang, X.Z.
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, China
fDate :
4/1/2003 12:00:00 AM
Abstract :
Feature subset selection is a well-known pattern recognition problem, which aims to reduce the number of features used in classification or recognition. This reduction is expected to improve the performance of classification algorithms in terms of speed, accuracy and simplicity. Most existing feature selection investigations focus on the case that the feature values are real or nominal, very little research is found to address the fuzzy-valued feature subset selection and its computational complexity. This paper focuses on a problem called optimal fuzzy-valued feature subset selection (OFFSS), in which the quality-measure of a subset of features is defined by both the overall overlapping degree between two classes of examples and the size of feature subset. The main contributions of this paper are that: 1) the concept of fuzzy extension matrix is introduced; 2) the computational complexity of OFFSS is proved to be NP-hard; 3) a simple but powerful heuristic algorithm for OFFSS is given; and 4) the feasibility and simplicity of the proposed algorithm are demonstrated by applications of OFFSS to fuzzy decision tree induction and by comparisons with three different feature selection techniques developed recently.
Keywords :
computational complexity; data mining; feature extraction; fuzzy set theory; learning (artificial intelligence); pattern classification; computational complexity; data mining; feature subset selection; learning; optimal fuzzy-valued feature; pattern classification; pattern recognition; Classification algorithms; Computational complexity; Data mining; Decision trees; Heuristic algorithms; Linear discriminant analysis; Machine learning; Neural networks; Pattern recognition; Principal component analysis;
Journal_Title :
Fuzzy Systems, IEEE Transactions on
DOI :
10.1109/TFUZZ.2003.809895