DocumentCode :
1616114
Title :
Structure Based Prediction of Binding Residues on DNA-binding Proteins
Author :
Bhardwaj, Nitin ; Langlois, Robert E. ; Zhao, Guijun ; Lu, Hui
Author_Institution :
Dept. of Bioeng., Illinois Univ., Champaign, IL
fYear :
2005
fDate :
6/27/1905 12:00:00 AM
Firstpage :
2611
Lastpage :
2614
Abstract :
Annotation of the functional sites on the surface of a protein has been the subject of many studies. In this regard, the search for attributes and features characterizing these sites is of prime consequence. Here, we present an implementation of a kernel-based machine learning protocol for identifying residues on a DNA-binding protein form the interface with the DNA. Sequence and structural features including solvent accessibility, local composition, net charge and electrostatic potentials are examined. These features are then fed into support vector machines (SVM) to predict the DNA-binding residues on the surface of the protein. In order to compare with published work, we predict binding residues by training on other binding and non-binding residues in the same protein for which we achieved an accuracy of 79%. The sensitivity and specificity are 59% and 89%. We also consider a more realistic approach, predicting the binding residues of proteins entirely withheld from the training set achieving values of 66%, 43% and 81%, respectively. Performances reported here are better than other published results. Moreover, since our protocol does not lean on sequence or structural homology, it can be used to annotate unclassified proteins and more generally to identify novel binding sites with no similarity to the known cases
Keywords :
DNA; bioelectric potentials; biology computing; learning (artificial intelligence); molecular biophysics; molecular configurations; prediction theory; proteins; support vector machines; DNA-binding proteins; SVM; binding residues; electrostatic potentials; kernel-based machine learning; local composition; net charge; protein sequence; solvent accessibility; structure-based prediction; support vector machines; Access protocols; Bioinformatics; DNA; Electrostatics; Machine learning; Neural networks; Proteins; Sequences; Solvents; Support vector machines; SVMs; binding site prediction; function annotation; protein-DNA interaction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2005. IEEE-EMBS 2005. 27th Annual International Conference of the
Conference_Location :
Shanghai
Print_ISBN :
0-7803-8741-4
Type :
conf
DOI :
10.1109/IEMBS.2005.1617004
Filename :
1617004
Link To Document :
بازگشت