Title :
Divide-and-Conquer Algorithm for Clustalw-MPI
Author :
Rezaei, Siamak ; Monwar, Md Maruf
Author_Institution :
Comput. Sci., Univ. of Northern British Columbia, BC
Abstract :
Multiple sequence alignment continues to be an active field of research in computational biology and the most widely used tool for multiple sequence alignment is ClustalW, which achieves alignment via three steps: pair wise alignment, guide tree generation and progressive alignment. ClustalW-MPI is a parallel implementation of ClustalW. In this paper, a new approach, divide-and-conquer, is implemented which uses ClustalW-MPI for sequence alignment but it gets a better speed up performance than ClustalW-MPI. In this approach, the sequences are first cut down into smaller subsequences by divide-and-conquer technique to minimize the computational space. Then these subsequences are sent to different available processors using message passing interface technique. Those processors align the subsequences by executing ClustalW-MPI simultaneously. After aligning, the results are then sent to the main processor to be concatenated to produce the final alignment. But some quality of the alignment may be compromised in this approach for the introduction of gaps at the start or end of subsequences aligned. Therefore, some heuristic methods for fixing the cut points were suggested for future improvement, such as overlapping alignment and sliding window alignment
Keywords :
application program interfaces; biology computing; divide and conquer methods; genetics; message passing; sequences; trees (mathematics); ClustalW-MPI; computational biology; divide-and-conquer algorithm; guide tree generation; message passing interface technique; multiple sequence alignment; pair wise alignment; progressive alignment; Bioinformatics; Computational biology; Computer science; Concatenated codes; Dynamic programming; Genomics; Heuristic algorithms; Message passing; Proteins; Sequences; Divide-and-Conquer algorithm; message passing interface; multiple sequence alignment; multiple tree alignment;
Conference_Titel :
Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
Conference_Location :
Ottawa, Ont.
Print_ISBN :
1-4244-0038-4
Electronic_ISBN :
1-4244-0038-4
DOI :
10.1109/CCECE.2006.277630