• DocumentCode
    1697075
  • Title

    Stability of different feature selection methods for selecting protein sequence descriptors in protein solubility classification problem

  • Author

    Kocbek, Simon ; Stiglic, Gregor ; Pernek, Igor ; Kokol, Peter

  • Author_Institution
    Fac. of Health Sci., Univ. of Maribor, Maribor, Slovenia
  • fYear
    2010
  • Firstpage
    50
  • Lastpage
    55
  • Abstract
    Predicting protein solubility has gained lots of intention in the recent years and several descriptors have been defined to describe proteins in these works. Therefore, different feature selection methods have been used for selecting the most important attributes. An empirical study, that aims to explain the relationship between the number of samples and stability of seven different feature selection techniques for protein datasets, is presented.
  • Keywords
    biology computing; pattern classification; proteins; feature selection method stability; protein datasets; protein sequence descriptors selection; protein solubility classification problem; Amino acids; Correlation; Databases; Numerical stability; Proteins; Stability analysis; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer-Based Medical Systems (CBMS), 2010 IEEE 23rd International Symposium on
  • Conference_Location
    Perth, WA
  • ISSN
    1063-7125
  • Print_ISBN
    978-1-4244-9167-4
  • Type

    conf

  • DOI
    10.1109/CBMS.2010.6042613
  • Filename
    6042613