Title :
Learning with only the relevant features
Author :
Lashkia, George V.
Author_Institution :
Dept. of Inf. & Comput. Eng, Okayama Univ., Japan
Abstract :
In real-world concept-learning problems, the representation of data often uses many features, only a few of which may be related to the target concept. In this situation, the selection of relevant features becomes important for achieving high performance of a learning algorithm. A feature selection method based on tests has been introduced and its effectiveness in achieving high recognition rates has been shown (a test is a set of features that is sufficient to distinguish patterns from different classes of training samples). We show that the notion of a test is directly related to the notion of relevant attributes. We prove that prime tests represent sets of irredundant relevant features, and emphasize the importance of the test notion. We describe a way of selecting all relevant features from training examples, and possible ways to identify relevant features of the target concept. The empirical results show the effectiveness of the proposed method on real databases
Keywords :
feature extraction; learning by example; pattern classification; testing; concept learning; data representation; databases; feature selection method; irredundant relevant features; learning algorithm performance; pattern classification; prime tests; recognition rates; relevant attributes; training examples; Boolean functions; Feature extraction; Filters; Induction generators; Machine learning; Spatial databases; Testing;
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
Print_ISBN :
0-7803-7087-2
DOI :
10.1109/ICSMC.2001.969828