• DocumentCode
    87695
  • Title

    Protein Chain Pair Simplification under the Discrete Fréchet Distance

  • Author

    Wylie, Tim ; Binhai Zhu

  • Author_Institution
    Dept. of Comput. Sci., Montana State Univ., Bozeman, MT, USA
  • Volume
    10
  • Issue
    6
  • fYear
    2013
  • fDate
    Nov.-Dec. 2013
  • Firstpage
    1372
  • Lastpage
    1383
  • Abstract
    For protein structure alignment and comparison, a lot of work has been done using RMSD as the distance measure, which has drawbacks under certain circumstances. Thus, the discrete Frόchet distance was recently applied to the problem of protein (backbone) structure alignment and comparison with promising results. For this problem, visualization is also important because protein chain backbones can have as many as 500-600 α-carbon atoms, which constitute the vertices in the comparison. Even with an excellent alignment, the similarity of two polygonal chains can be difficult to visualize unless the chains are nearly identical. Thus, the chain pair simplification problem (CPS-3F) was proposed in 2008 to simultaneously simplify both chains with respect to each other under the discrete Frochet distance. The complexity of CPS-3F is unknown, so heuristic methods have been developed. Here, we define a variation of CPS-3F, called the constrained CPS-3F problem (CPS-3Fþ), and prove that it is polynomially solvable by presenting a dynamic programming solution, which we then prove is a factor-2 approximation for CPS-3F. We then compare the CPS-3Fþ solutions with previous empirical results, and further demonstrate some of the benefits of the simplified comparisons. Chain pair simplification based on the Hausdorff distance (CPS-2H) is known to be NP-complete, and here we prove that the constrained version (CPS-2H+) is also NP-complete. Finally, we discuss future work and implications along with a software library implementation, named the Frochet-based Protein Alignment & Comparison Toolkit (FPACT).
  • Keywords
    bioinformatics; dynamic programming; molecular configurations; proteins; proteomics; FPACT; Frochet-based protein alignment and comparison toolkit; Hausdorff distance; RMSD; alpha-carbon atoms; constrained CPS-3F problem; discrete Frechet distance; dynamic programming solution; factor-two approximation; heuristic methods; polygonal chains; protein chain backbones; protein chain pair simplification; protein structure alignment; software library implementation; Approximation algorithms; Approximation methods; Bioinformatics; Dynamic programming; Proteins; Visualization; Approximation algorithms; Discrete Fréchet distance; NP-complete; Protein structure alignment; dynamic programming; protein structure simplification and visualization;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.17
  • Filename
    6477035