DocumentCode
1697075
Title
Stability of different feature selection methods for selecting protein sequence descriptors in protein solubility classification problem
Author
Kocbek, Simon ; Stiglic, Gregor ; Pernek, Igor ; Kokol, Peter
Author_Institution
Fac. of Health Sci., Univ. of Maribor, Maribor, Slovenia
fYear
2010
Firstpage
50
Lastpage
55
Abstract
Predicting protein solubility has gained lots of intention in the recent years and several descriptors have been defined to describe proteins in these works. Therefore, different feature selection methods have been used for selecting the most important attributes. An empirical study, that aims to explain the relationship between the number of samples and stability of seven different feature selection techniques for protein datasets, is presented.
Keywords
biology computing; pattern classification; proteins; feature selection method stability; protein datasets; protein sequence descriptors selection; protein solubility classification problem; Amino acids; Correlation; Databases; Numerical stability; Proteins; Stability analysis; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer-Based Medical Systems (CBMS), 2010 IEEE 23rd International Symposium on
Conference_Location
Perth, WA
ISSN
1063-7125
Print_ISBN
978-1-4244-9167-4
Type
conf
DOI
10.1109/CBMS.2010.6042613
Filename
6042613
Link To Document