Title :
Fast semi-local alignment for DNA sequence database search
Author :
Chen, Yong-Sheng ; Hung, Yi-Ping ; Fuh, Chiou-Shann
Author_Institution :
Dept. of Med. Res. & Educ., Taipei Veterans Gen. Hosp., Taiwan
Abstract :
Given a query DNA sequence, our goal is to find in the DNA sequence database all the sequence segments that are similar to the query. We present a string-to-signal transform technique that can transform a DNA sequence into a four-channel signal. Without considering gaps, the edit distance between two DNA sequences can be calculated as the sum of absolute difference (SAD) between their corresponding four-channel signals. The algorithm proposed can then be applied to speed up the process of searching for the desired sequence segments that yield small SADs. In addition to efficiency, this algorithm guarantees the optimal search. That is, all the sequence segments that are similar enough to the query can be found without any miss.
Keywords :
DNA; biology computing; database management systems; query processing; scientific information systems; string matching; DNA sequence database search; edit distance; fast semi-local alignment; four-channel signal; optimal search; string-to-signal transform technique; sum of absolute difference; Biomedical engineering; Computer science; Computer science education; DNA; Databases; Dynamic programming; Hospitals; Information analysis; Information science; Sequences;
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-1695-X
DOI :
10.1109/ICPR.2002.1048211