Title :
A heuristic algorithm for blocked multiple sequence alignment
Author :
Jiang, Tao ; Zhao, Peng
Author_Institution :
Dept. of Comput. Sci., California Univ., Riverside, CA, USA
Abstract :
Blocked multiple-sequence alignment (BMA) refers to the construction of multiple alignments in DNA by first aligning conserved regions into what we call “blocks” and then aligning the regions between successive blocks to form a final alignment. Instead of starting from low-order pairwise alignments, we propose a new way to form blocks by searching for closely related regions in all input sequences, allowing internal spaces in blocks as well as some degree of mismatch. We address the problem of semi-conserved patterns (patterns that do not appear in all input sequences) by introducing into the process two similarity thresholds that are adjusted dynamically according to the input. A method to control the number of blocks is also presented to deal with the situation when input sequences have so many similar regions that it becomes impractical to form blocks by trying every combination. BMA is an implementation of this approach, and our experimental results indicate that this approach is efficient, particularly on large numbers of long sequences with well-conserved regions
Keywords :
DNA; biology computing; heuristic programming; molecular biophysics; molecular orientation; pattern matching; physics computing; sequences; BMA algorithm; DNA; block number control; blocked multiple sequence alignment; closely related regions; conserved regions; dynamically adjusted similarity thresholds; heuristic algorithm; input sequences; internal spaces; long sequences; mismatch; semi-conserved patterns; similar regions; Computational biology; Computer science; DNA; Heuristic algorithms; Proteins; Sequences; Tin;
Conference_Titel :
Bio-Informatics and Biomedical Engineering, 2000. Proceedings. IEEE International Symposium on
Conference_Location :
Arlington, VA
Print_ISBN :
0-7695-0862-6
DOI :
10.1109/BIBE.2000.889605