DocumentCode
1233815
Title
A Parallel Pairwise Local Sequence Alignment Algorithm
Author
Bandyopadhyay, Sanghamitra ; Mitra, Ramkrishna
Author_Institution
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Volume
8
Issue
2
fYear
2009
fDate
6/1/2009 12:00:00 AM
Firstpage
139
Lastpage
146
Abstract
Researchers are compelled to use heuristic-based pairwise sequence alignment tools instead of Smith-Waterman (SW) algorithm due to space and time constraints, thereby losing significant amount of sensitivity. Parallelization is a possible solution, though, till date, the parallelization is restricted to database searching through database fragmentation. In this paper, the power of a cluster computer is utilized for developing a parallel algorithm, RPAlign, involving, first, the detection of regions that are potentially alignable, followed by their actual alignment. RPAlign is found to reduce the timing requirement by a factor of upto 9 and 99 when used with the basic local alignment search tool (BLAST) and SW, respectively, while keeping the sensitivity similar to the corresponding method. For distantly related sequences, which remain undetected by BLAST, RPAlign with SW can be used. Again, for megabase-scale sequences, when SW becomes computationally intractable, the proposed method can still align them reasonably fast with high sensitivity.
Keywords
DNA; bioinformatics; genomics; heuristic programming; molecular biophysics; parallel algorithms; very large databases; BLAST comparison; RPAlign; SW comparison; Smith-Waterman algorithm; basic local alignment search tool; cluster computer; heuristic based pairwise sequence alignment; megabase scale sequences; parallel algorithm; parallel pairwise local sequence alignment algorithm; parallelisation; potentially alignable region detection; timing requirement; Basic local alignment search tool (BLAST); Smith–Waterman (SW); message passing interface (MPI); parallel computing; Algorithms; Sequence Alignment; Sequence Analysis; Sequence Homology;
fLanguage
English
Journal_Title
NanoBioscience, IEEE Transactions on
Publisher
ieee
ISSN
1536-1241
Type
jour
DOI
10.1109/TNB.2009.2019642
Filename
4813201
Link To Document