DocumentCode
242949
Title
Stability of feature selection algorithms and its influence on prediction accuracy in biomedical datasets
Author
Drotar, Peter ; Smekal, Zdenek
Author_Institution
Dept. of Telecommun., Brno Univ. of Technol., Brno, Czech Republic
fYear
2014
fDate
22-25 Oct. 2014
Firstpage
1
Lastpage
5
Abstract
Feature selection techniques become significant part of many bioinformatics and biomedical applications. Choosing the important features is essential for biomarker discovery, provide better understanding of the data and potentially improve prediction performance. However, as the number of samples in dataset is small, the feature selection tends to be unstable. In this paper, the stability of five popular feature selection techniques is compared and analyzed when original dataset is perturbed by adding, removing or simply resampling the original observations. Next, the feature selection techniques are used as filter prior to AdaBoost classifier and their influence on classification accuracy and Mathews correlation coefficient is compared.
Keywords
bioinformatics; feature selection; Mathews correlation coefficient; bioinformatics applications; biomarker discovery; biomedical applications; biomedical datasets; feature selection algorithm stability; filter prior-to-AdaBoost classifier; Accuracy; Bioinformatics; Diseases; Power system stability; Redundancy; Stability criteria; Adaboost; Dunne stability index; bioinformatics; feature selection; stability;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2014 - 2014 IEEE Region 10 Conference
Conference_Location
Bangkok
ISSN
2159-3442
Print_ISBN
978-1-4799-4076-9
Type
conf
DOI
10.1109/TENCON.2014.7022309
Filename
7022309
Link To Document