• DocumentCode
    2582802
  • Title

    Approximate global alignment of sequences

  • Author

    Kahveci, Tamer ; Ramaswamy, Venkatakrishnan ; Tao, Han ; Li, Tao

  • Author_Institution
    Dept. of Comput. & Inf. Sci. & Eng., Florida Univ., FL, USA
  • fYear
    2005
  • fDate
    19-21 Oct. 2005
  • Firstpage
    81
  • Lastpage
    88
  • Abstract
    We propose two novel dynamic programming (DP) methods that solve the approximate bounded and unbounded global alignment problems for biological sequences. Our first method solves the bounded alignment problem. It computes the distribution of the edit distance between the remaining suffixes. For a given bound k and approximation p%, it uses this distribution to prune the entries of the DP matrix that will lead to alignments with more than k edit operations with more than p% probability. Our second method addresses the unbounded global alignment problem. For each entry of the distance matrix, it dynamically computes an upper bound to the distance between the unaligned suffixes. This bound, along with the lower bound as computed for the bounded case, is then used to eliminate the entries of the distance matrix. According to our experimental results, our methods are up to three times faster than the competing methods for the bounded alignment and up to two times faster for the unbounded alignment, even with 100% approximation. Our methods use only 17-68% of the space used by the next best competitor.
  • Keywords
    biology computing; dynamic programming; molecular biophysics; molecular configurations; approximate global sequence alignment; biological sequences; bounded alignment problem; dynamic programming; unbounded global alignment problem; Approximation algorithms; Assembly; Bioinformatics; Biology computing; Distributed computing; Dynamic programming; Frequency; Information science; Phylogeny; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
  • Print_ISBN
    0-7695-2476-1
  • Type

    conf

  • DOI
    10.1109/BIBE.2005.13
  • Filename
    1544452