Title :
Feature subset selection approach based on fuzzy rough set for high-dimensional data
Author :
Changyou Guo ; Xuefeng Zheng
Author_Institution :
Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China
Abstract :
Feature subset selection, as an important processing step to knowledge discovery and machine learning, is effective method in reducing irrelevant and or redundant features, compressing repeated data, and improving classification accuracy. Rough set theory is an important tool to select feature subset from high-dimensional data. In this work, feature subset selection based on fuzzy rough set is introduced, and the efficient measure of feature significance is designed. Based on the fuzzy rough set model, a quick feature subset selection approach is presented, which can efficiently identify relevant features as well as redundancy among all features. In addition, the KNN-based classifier based on the proposed approach is constructed. The experimental results show that the proposed feature subset selection approach achieves better classification on UCI datasets.
Keywords :
data mining; fuzzy set theory; learning (artificial intelligence); rough set theory; KNN-based classifier; feature subset selection; fuzzy rough set; high-dimensional data; knowledge discovery; machine learning; Accuracy; Approximation methods; Data models; Feature extraction; Information systems; Rough sets; Feature subset selection; data mining; fuzzy rough set; high-dimensional data; rough set;
Conference_Titel :
Granular Computing (GrC), 2014 IEEE International Conference on
Conference_Location :
Noboribetsu
DOI :
10.1109/GRC.2014.6982810