Title :
Improved pairwise alignment of proteins in the Twilight Zone using local structure predictions
Author :
Huang, Yao-Ming ; Bystroff, Christopher
Author_Institution :
Dept. of Biol., Rensselaer Polytech. Inst., Troy, NY, USA
Abstract :
Recent advances in the ability to discriminate between homologous and non-homologous proteins in the "Twilight Zone" of sequence similarity, must be accompanied by accurate alignments if they are to be of value to molecular modelers. Pairwise alignments require a measure of evolutionary distance, traditionally modeled using global amino acid substitution matrices. But real differences in the likelihood of substitutions may exist for different structural contexts within proteins, since structure contributes to the selective pressure. HMMSUM (HMMSTR-based SUbstitution Matrices) is a new model for structure-dependent amino acid substitution probabilities consisting of a set of 281 matrices, one for each of the sequence-structure contexts defined in HMMSTR (a Hidden Markov Model for protein STRucture). HMMSUM does not require the structure of the protein to be known, using HMMSTR predictions instead. Alignments using the HMMSUM compare favorably BLOSUM50 alignments when validated against curated remote homolog alignments from BAIiBASE.
Keywords :
biology computing; distance measurement; hidden Markov models; molecular biophysics; proteins; BAIiBASE; BLOSUM50 alignments; evolutionary distance; global amino acid substitution matrices; hidden Markov model; molecular modelers; pairwise alignment; proteins; remote homolog alignments; sequence-structure context; substitution matrices; twilight zone; Amino acids; Bioinformatics; Costs; Frequency estimation; Genetic mutations; Hidden Markov models; Matrices; Probability; Proteins; Sequences;
Conference_Titel :
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN :
0-7695-2442-7
DOI :
10.1109/CSBW.2005.75