DocumentCode
677896
Title
Two Extensions to Multi-label Correlation-Based Feature Selection: A Case Study in Bioinformatics
Author
Jungjit, Suwimol ; Freitas, Alex A. ; Michaelis, Martin ; Cinatl, J.
Author_Institution
Sch. of Comput., Univ. of Kent, Canterbury, UK
fYear
2013
fDate
13-16 Oct. 2013
Firstpage
1519
Lastpage
1524
Abstract
This paper proposes two extensions to a Multi-Label Correlation Based Feature Selection Method (ML-CFS): (1) ML-CFS using the absolute value of the correlation coefficient in the equation for evaluating a candidate feature subset, and (2) ML-CFS using Mutual Information for class label weighting. These extensions are evaluated in a bioinformatics case study addressing the multi-label classification of a cancer-related DNA micro array dataset with over 20,000 features. The results show that ML-CFS with absolute value of correlation obtained a significantly better predictive accuracy (smaller hamming loss) than the original ML-CFS. On the other hand, using Mutual Information to assign weights to labels showed some positive effect when using the ML-RBF classifier, but it showed a negative effect when using the ML-kNN classifier.
Keywords
bioinformatics; cancer; feature extraction; lab-on-a-chip; pattern classification; radial basis function networks; set theory; ML-CFS; ML-RBF classifier; ML-kNN classifier; bioinformatics; cancer-related DNA microarray dataset; candidate feature subset; class label weighting; correlation coefficient; multilabel classification; multilabel correlation based feature selection method; mutual information; predictive accuracy; Accuracy; Classification algorithms; Correlation; Correlation coefficient; DNA; Equations; Mutual information; microarray data; multi-label classification; multi-label feature selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
Conference_Location
Manchester
Type
conf
DOI
10.1109/SMC.2013.262
Filename
6722015
Link To Document