Title :
Effective Feature Selection on Data with Uncertain Labels
Author :
Wang, Bo ; Jia, Yan ; Han, Yi ; Han, Weihong
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol.
fDate :
March 29 2009-April 2 2009
Abstract :
Nowadays, various learning technologies are required on uncertain data. As an important pre-processing step in data mining, feature selection needs to consider this vagueness or uncertainty. In this paper, we propose a novel algorithm to evaluate the correlation between features and uncertain class labels on the basis of Hilbert-Schmidt Independence Criterion. Consequently, the features can be ranked according to this criterion. Experimental results on extensive datasets demonstrate the benefits of our method.
Keywords :
Hilbert spaces; data mining; feature extraction; learning (artificial intelligence); uncertainty handling; Hilbert-Schmidt independence criterion; data mining; feature selection; learning technology; uncertain label; Cardiac disease; Data engineering; Data mining; Face recognition; Filters; Hypertension; Loss measurement; Monitoring; Sampling methods; Uncertainty; feature selection; uncertain data;
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
DOI :
10.1109/ICDE.2009.170