DocumentCode :
2735216
Title :
Optimal nonlinear scoring function for global fitness landscape of protein design
Author :
Hu, Changyu ; Li, Xiang ; Liang, Jie
Author_Institution :
Dept. of Bioeng., Illinois Univ., Chicago, IL, USA
Volume :
2
fYear :
2004
fDate :
1-5 Sept. 2004
Firstpage :
2828
Lastpage :
2831
Abstract :
Protein design aims to identify sequences compatible with a given protein fold but incompatible to any alternative folds. To select the correct sequences and to guide the search process, a design scoring function is critically important. It is also important that a design scoring function can characterize the global fitness landscape of many proteins simultaneously. We describe how finding optimal design scoring functions can be understood from two geometric viewpoints, and propose a formulation using mixture of Gaussian kernel functions. We give results of distinguishing native sequences for a major portion of representative protein structures from a large number of alternative decoy sequences. We succeeded in deriving nonlinear scoring function that perfectly discriminate a set of 440 representative native proteins of known protein structures from 14 million sequence decoys. We show that no linear scoring function can have perfect discrimination. In an independent blind test using 194 unrelated proteins, our scoring function misclassifies only 13 native proteins. This compares favorably with 37 or 51 misclassifications when optimal linear functions reported in literature are used.
Keywords :
biology computing; macromolecules; molecular biophysics; molecular configurations; proteins; Gaussian kernel functions; decoy sequences; global fitness landscape; independent blind test; optimal nonlinear scoring function; protein design; protein sequences; protein structures; Amino acids; Biomedical engineering; Kernel; Performance evaluation; Process design; Protein engineering; Protein sequence; Testing; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-8439-3
Type :
conf
DOI :
10.1109/IEMBS.2004.1403807
Filename :
1403807
Link To Document :
بازگشت