DocumentCode :
1991159
Title :
Characterizing and Predicting Catalytic Residues in Enzyme Active Sites Based on Local Properties: A Machine Learning Approach
Author :
Bobadilla, Leonardo ; Nino, Fernando ; Cepeda, Edilberto ; Patarroyo, M.A.
Author_Institution :
Nat. Univ. of Colombia, Bogota
fYear :
2007
fDate :
14-17 Oct. 2007
Firstpage :
938
Lastpage :
945
Abstract :
Developing computational methods for assigning protein function from tertiary structure is a very important problem, predicting a catalytic mechanism based only on structural information being a particularly challenging task. This work focuses on helping to understand the molecular basis of catalysis by exploring the nature of catalytic residues, their environment and characteristic properties in a large data set of enzyme structures and using this information to predict enzyme structures´ active sites. A machine learning approach that performs feature extraction, clustering and classification on a protein structure data set is proposed. The 6,376 residues directly involved in enzyme catalysis, present in more than 800 proteins structures in the PDB were analyzed. Feature extraction provided a description of critical features for each catalytic residue, which were consistent with prior knowledge about them. Results from k-fold-cross-validation for classification showed more than 80% accuracy. Complete enzymes were scanned using these classifiers to locate catalytic residues.
Keywords :
biochemistry; biology computing; catalysis; enzymes; feature extraction; genetics; learning (artificial intelligence); molecular biophysics; pattern classification; pattern clustering; catalytic residue prediction; enzyme active site; enzyme structure pridiction; feature extraction; functional genomics; k-fold-cross-validation; machine learning; pattern classification; pattern clustering; protein functional sites; protein structure data set; structural genomics; tertiary structure; Biochemistry; Bioinformatics; Crystallography; Data mining; Genomics; Machine learning; Nuclear magnetic resonance; Predictive models; Protein engineering; Sequences; Catalytic Residues; Functional Genomics; Machine Learning; Protein functional sites; Structural Genomics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
Type :
conf
DOI :
10.1109/BIBE.2007.4375671
Filename :
4375671
Link To Document :
بازگشت