Title :
Discrimination of Insoluble-Carbohydrate Binding Proteins and Their Binding Sites Using a 3D Motif Detection Method
Author :
Doxey, Andrew C. ; Cheng, Zhenyu ; McConkey, Brendan J.
Author_Institution :
Dept. of Biol., Univ. of Waterloo, Waterloo, ON
Abstract :
We apply a 3D motif detection approach to insoluble-carbohydrate binding modules (CBMs), a class of proteins that share a common binding activity but lack a common sequence motif or fold. Key features of insoluble-carbohydrate binding sites were incorporated into a 3D motif detection algorithm, and used in a linear discriminant analysis. Our method effectively discriminated all known type A CBMs from a non-redundant structural dataset and correctly detected known binding sites without any false positives. The algorithm was used to screen a structural database of homology-modeled proteins from tobacco, and the results were experimentally validated using an affinity purification assay and mass spectrometric protein identification. The algorithm correctly predicted CBMs and binding sites not included in the training set, and predicted a previously unknown binding site in the PR-5d protein family. This work highlights the potential of 3D motif detection methods for use in large-scale functional annotation.
Keywords :
biology computing; learning (artificial intelligence); mass spectra; molecular biophysics; proteins; purification; sugar; 3D motif detection; affinity purification assay; binding activity; carbohydrate binding proteins; machine-learning approach; mass spectrometric protein identification; structural database; tobacco; Amino acids; Bioinformatics; Crystallization; Databases; Detection algorithms; Mass spectroscopy; Prediction algorithms; Proteins; Purification; Sequences; 3D motif; binding-site prediction; carbohydrate-binding proteins; cellulose; chitin;
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
DOI :
10.1109/BIBM.2008.74