Title of article

Automatic Methods for Predicting Functionally Important Residues

Author/Authors

Antonio del Sol Mesa، نويسنده , , Florencio Pazos، نويسنده , , Alfonso Valencia، نويسنده ,

Issue Information

روزنامه با شماره پیاپی سال 2003

Pages

From page

1289

To page

1302

Abstract

Sequence analysis is often the first guide for the prediction of residues in a protein family that may have functional significance. A few methods have been proposed which use the division of protein families into subfamilies in the search for those positions that could have some functional significance for the whole family, but at the same time which exhibit the specificity of each subfamily (“Tree-determinant residues”). However, there are still many unsolved questions like the best division of a protein family into subfamilies, or the accurate detection of sequence variation patterns characteristic of different subfamilies. Here we present a systematic study in a significant number of protein families, testing the statistical meaning of the Tree-determinant residues predicted by three different methods that represent the range of available approaches. The first method takes as a starting point a phylogenetic representation of a protein family and, following the principle of Relative Entropy from Information Theory, automatically searches for the optimal division of the family into subfamilies. The second method looks for positions whose mutational behavior is reminiscent of the mutational behavior of the full-length proteins, by directly comparing the corresponding distance matrices. The third method is an automation of the analysis of distribution of sequences and amino acid positions in the corresponding multidimensional spaces using a vector-based principal component analysis. These three methods have been tested on two non-redundant lists of protein families: one composed by proteins that bind a variety of ligand groups, and the other composed by proteins with annotated functionally relevant sites. In most cases, the residues predicted by the three methods show a clear tendency to be close to bound ligands of biological relevance and to those amino acids described as participants in key aspects of protein function. These three automatic methods provide a wide range of possibilities for biologists to analyze their families of interest, in a similar way to the one presented here for the family of proteins related with ras-p21.

Keywords

Bioinformatics , protein structure , protein function , Tree-determinant position , functional residue

Journal title

Journal of Molecular Biology

Serial Year

2003

Journal title

Journal of Molecular Biology

Record number

1242449

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1242449