DocumentCode :
1757875
Title :
A Rough Hypercuboid Approach for Feature Selection in Approximation Spaces
Author :
Maji, Pradipta
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Volume :
26
Issue :
1
fYear :
2014
fDate :
Jan. 2014
Firstpage :
16
Lastpage :
29
Abstract :
The selection of relevant and significant features is an important problem particularly for data sets with large number of features. In this regard, a new feature selection algorithm is presented based on a rough hypercuboid approach. It selects a set of features from a data set by maximizing the relevance, dependency, and significance of the selected features. By introducing the concept of the hypercuboid equivalence partition matrix, a novel representation of degree of dependency of sample categories on features is proposed to measure the relevance, dependency, and significance of features in approximation spaces. The equivalence partition matrix also offers an efficient way to calculate many more quantitative measures to describe the inexactness of approximate classification. Several quantitative indices are introduced based on the rough hypercuboid approach for evaluating the performance of the proposed method. The superiority of the proposed method over other feature selection methods, in terms of computational complexity and classification accuracy, is established extensively on various real-life data sets of different sizes and dimensions.
Keywords :
computational complexity; data mining; matrix algebra; pattern classification; rough set theory; approximate classification; approximation spaces; classification accuracy; computational complexity; data mining; feature dependency; feature relevance; feature selection; feature significance; hypercuboid equivalence partition matrix; quantitative indices; quantitative measures; rough hypercuboid approach; Approximation methods; Data analysis; Data mining; Redundancy; Rough sets; Uncertainty; Pattern recognition; data mining; feature selection; rough hypercuboid approach; rough sets;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2012.242
Filename :
6381414
Link To Document :
بازگشت