• DocumentCode
    242949
  • Title

    Stability of feature selection algorithms and its influence on prediction accuracy in biomedical datasets

  • Author

    Drotar, Peter ; Smekal, Zdenek

  • Author_Institution
    Dept. of Telecommun., Brno Univ. of Technol., Brno, Czech Republic
  • fYear
    2014
  • fDate
    22-25 Oct. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Feature selection techniques become significant part of many bioinformatics and biomedical applications. Choosing the important features is essential for biomarker discovery, provide better understanding of the data and potentially improve prediction performance. However, as the number of samples in dataset is small, the feature selection tends to be unstable. In this paper, the stability of five popular feature selection techniques is compared and analyzed when original dataset is perturbed by adding, removing or simply resampling the original observations. Next, the feature selection techniques are used as filter prior to AdaBoost classifier and their influence on classification accuracy and Mathews correlation coefficient is compared.
  • Keywords
    bioinformatics; feature selection; Mathews correlation coefficient; bioinformatics applications; biomarker discovery; biomedical applications; biomedical datasets; feature selection algorithm stability; filter prior-to-AdaBoost classifier; Accuracy; Bioinformatics; Diseases; Power system stability; Redundancy; Stability criteria; Adaboost; Dunne stability index; bioinformatics; feature selection; stability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2014 - 2014 IEEE Region 10 Conference
  • Conference_Location
    Bangkok
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-4076-9
  • Type

    conf

  • DOI
    10.1109/TENCON.2014.7022309
  • Filename
    7022309