DocumentCode :
2582802
Title :
Approximate global alignment of sequences
Author :
Kahveci, Tamer ; Ramaswamy, Venkatakrishnan ; Tao, Han ; Li, Tao
Author_Institution :
Dept. of Comput. & Inf. Sci. & Eng., Florida Univ., FL, USA
fYear :
2005
fDate :
19-21 Oct. 2005
Firstpage :
81
Lastpage :
88
Abstract :
We propose two novel dynamic programming (DP) methods that solve the approximate bounded and unbounded global alignment problems for biological sequences. Our first method solves the bounded alignment problem. It computes the distribution of the edit distance between the remaining suffixes. For a given bound k and approximation p%, it uses this distribution to prune the entries of the DP matrix that will lead to alignments with more than k edit operations with more than p% probability. Our second method addresses the unbounded global alignment problem. For each entry of the distance matrix, it dynamically computes an upper bound to the distance between the unaligned suffixes. This bound, along with the lower bound as computed for the bounded case, is then used to eliminate the entries of the distance matrix. According to our experimental results, our methods are up to three times faster than the competing methods for the bounded alignment and up to two times faster for the unbounded alignment, even with 100% approximation. Our methods use only 17-68% of the space used by the next best competitor.
Keywords :
biology computing; dynamic programming; molecular biophysics; molecular configurations; approximate global sequence alignment; biological sequences; bounded alignment problem; dynamic programming; unbounded global alignment problem; Approximation algorithms; Assembly; Bioinformatics; Biology computing; Distributed computing; Dynamic programming; Frequency; Information science; Phylogeny; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
Type :
conf
DOI :
10.1109/BIBE.2005.13
Filename :
1544452
Link To Document :
بازگشت