DocumentCode :
2023521
Title :
Novel Application of Query-Based Qualitative Predictors for Characterization of Solvent Accessible Residues in Conjunction with Protein Sequence Homology
Author :
Rose, Daniel A. ; Nepal, Reecha ; Mishra, Radhika ; Lau, Robert ; Gholizadeh, Shabnam ; Lustig, Brooke
Author_Institution :
Dept. of Chem., San Jose State Univ., San Jose, CA, USA
fYear :
2011
fDate :
Aug. 29 2011-Sept. 2 2011
Firstpage :
70
Lastpage :
74
Abstract :
Prediction of relative solvent accessibility (RSA) is a standard first-approach in predicting three-dimensional protein structures. Here we have applied linear regression methods that include various sequence homology values for each residue as well as query residue qualitative predictors, corresponding to each of the twenty canonical amino acids. We fit the 268-protein learning set with a variety of sequence homology terms, including 20 and 6-term sequence entropy, and residue qualitative predictors. Then estimated RSA values are subsequently generated for the 215-protein Manesh test set. The qualitative predictors describe the actual query residue type (e.g. Gly) as opposed to the measures of sequence homology for the aligned subject sequences. This is consistent with our framework of modeling a limited set of discrete and/or physically intuitive predictors. Initial calculations involving normalized RSA values were considered as a likely first attempt, incorporating the notion of fitting an explicit binary characterization of individual residues, either as buried or accessible. Interestingly, the utilization of qualitative predictors showed significant prediction accuracy. Subsequent calculations using the original RSA values gave estimated values that, upon binary classification, indicated accuracies comparable to other first stage methods. Development of a second stage methodology is of current interest.
Keywords :
biology computing; molecular biophysics; query processing; regression analysis; 3D protein structure; Manesh test set; binary characterization; linear regression method; protein sequence homology; query residue qualitative predictor; query-based qualitative predictor; sequence entropy; sequence homology term; solvent accessible residue characterization; Accuracy; Amino acids; Entropy; Protein sequence; Solvents; Support vector machines; buried residues; hydrophobicity; qualitative predictors; sequence entropy; surface accessibilities;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2011 22nd International Workshop on
Conference_Location :
Toulouse
ISSN :
1529-4188
Print_ISBN :
978-1-4577-0982-1
Type :
conf
DOI :
10.1109/DEXA.2011.57
Filename :
6059796
Link To Document :
بازگشت