DocumentCode
2541790
Title
Prediction of Protein B-Factor Profile Based on Feature Selection and Kernel Learning
Author
Pan, Xiao-Yong ; Shen, Hong-Bin
Author_Institution
Inst. of Image Process. & Pattern Recognition, Shanghai Jiaotong Univ., Shanghai, China
fYear
2009
fDate
4-6 Nov. 2009
Firstpage
1
Lastpage
5
Abstract
B-factor reflects the atom´s uncertainty about its average position within a crystal structure and is highly correlated with protein functions. In this article, we propose a novel approach to predict the real value of B-factor. We firstly extract features from the protein sequences and their evolution information, then apply random forest tree to select the important features, which are further inputted to a two-stage support vector regression (SVR) for prediction. Our results have revealed that a systematic analysis of the importance of different features makes us have deep insights into the different contributions of features and is very necessary for developing effective B-factor prediction tools. We thus develop an online Web server, which is freely available at http://www.csbio.situ.edu.cn/bioinf/PredBF for academic use.
Keywords
bioinformatics; feature extraction; learning (artificial intelligence); molecular biophysics; proteins; regression analysis; support vector machines; trees (mathematics); SVR; atom uncertainty; crystal structure; feature extraction; feature selection; kernel learning; online Web server; protein B-factor profile function prediction tool; protein sequence evolution information; random forest tree; systematic analysis; two-stage support vector regression; Amino acids; Data mining; Feature extraction; Image processing; Kernel; Pattern recognition; Proteins; Regression tree analysis; Uncertainty; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-4199-0
Type
conf
DOI
10.1109/CCPR.2009.5344037
Filename
5344037
Link To Document