• DocumentCode
    1186041
  • Title

    TRIAL: A Tool for Finding Distant Structural Similarities

  • Author

    Venkateswaran, Jayendra ; Bin Song ; Kahveci, Tamer ; Jermaine, Christopher

  • Author_Institution
    Univ. of Florida, Gainesville, FL, USA
  • Volume
    8
  • Issue
    3
  • fYear
    2011
  • Firstpage
    819
  • Lastpage
    831
  • Abstract
    Finding structural similarities in distantly related proteins can reveal functional relationships that can not be identified using sequence comparison. Given two proteins A and B and threshold ϵ Å, we develop an algorithm, TRiplet-based Iterative ALignment (TRIAL) for computing the transformation of B that maximizes the number of aligned residues such that the root mean square deviation (RMSD) of the alignment is at most ϵ Å. Our algorithm is designed with the specific goal of effectively handling proteins with low similarity in primary structure, where existing algorithms perform particularly poorly. Experiments show that our method outperforms existing methods. TRIAL alignment brings the secondary structures of distantly related proteins to similar orientations. It also finds larger number of secondary structure matches at lower RMSD values and increased overall alignment lengths. Its classification accuracy is up to 63 percent better than other methods, including CE and DALI. TRIAL successfully aligns 83 percent of the residues from the smaller protein in reasonable time while other methods align only 29 to 65 percent of the residues for the same set of proteins.
  • Keywords
    bioinformatics; molecular biophysics; molecular configurations; proteins; CE; DALI; RMSD; TRIAL; aligned residues; distantly related proteins; primary structure; root mean square deviation; secondary structure; structural similarities; triplet-based iterative alignment algorithm; Algorithm design and analysis; Amino acids; Bioinformatics; Computational biology; Estimation theory; Iterative algorithms; Protein engineering; Research initiatives; Root mean square; Sequences; Protein structure; alignment.; tertiary structure; Algorithms; Analysis of Variance; Computational Biology; Databases, Protein; Models, Chemical; Models, Molecular; Protein Structure, Tertiary; Proteins; Reproducibility of Results; Sequence Alignment;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2009.28
  • Filename
    4798152