Title :
An efficient algorithm for local sequence alignment
Author :
Haque, Waqar ; Aravind, Alex ; Reddy, Bharath
Author_Institution :
Computer Science Program, University of Northern British Columbia, Canada, V2N 4Z9
Abstract :
DNA pairwise sequence alignment has been a subject of great interest in the past and still evokes large interest. Recent algorithms have either been slow and sensitive or fast and less sensitive. Here, we present a new algorithm which is fast and at the same time relatively sensitive. To increase the speed, we first build a suffix tree for both sequences and the alignment is triggered by the maximum matching substring. The algorithm employs mismatch seeds to increase both sensitivity and speed in the later stages. We tested our algorithm on randomly generated sequences of length up to 500 thousand and used Rosetta dataset to test the sensitivity of the algorithm.
Keywords :
Bioinformatics; Computational biology; Computer science; Concatenated codes; DNA; Databases; Dynamic programming; Heuristic algorithms; Sequences; Testing; Rosetta dataset; longest common substring; pairwise sequence alignment; suffix tree; Algorithms; Base Sequence; Molecular Sequence Data; Pattern Recognition, Automated; Sequence Alignment; Sequence Analysis, DNA; Software;
Conference_Titel :
Engineering in Medicine and Biology Society, 2008. EMBS 2008. 30th Annual International Conference of the IEEE
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-1814-5
Electronic_ISBN :
1557-170X
DOI :
10.1109/IEMBS.2008.4649419