DocumentCode
3204768
Title
A fast algorithm on average for all-against-all sequence matching
Author
Baeza-Yates, Ricardo A. ; Gonnet, Gaston H.
Author_Institution
Dept. de Ciencias de la Comput., Chile Univ., Santiago, Chile
fYear
1999
fDate
1999
Firstpage
16
Lastpage
23
Abstract
We present an algorithm which attempts to align pairs of subsequences from a database of genetic sequences. The algorithm simulates the classical dynamic programming alignment algorithm over a suffix array of the database. We provide a detailed average case analysis which shows that the running time of the algorithm is subquadratic with respect to the database size. A similar algorithm solves the approximate string matching problem in sublinear average time
Keywords
biology computing; computational complexity; database theory; dynamic programming; molecular biophysics; string matching; DNA; all-against-all sequence matching; approximate string matching problem; average case analysis; dynamic programming alignment algorithm; fast algorithm; genetic sequences; peptide sequence database; running time; suffix array; Algorithm design and analysis; Computational modeling; DNA; Databases; Dynamic programming; Genetics; Heuristic algorithms; Impedance matching; Peptides; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
String Processing and Information Retrieval Symposium, 1999 and International Workshop on Groupware
Conference_Location
Cancun
Print_ISBN
0-7695-0268-7
Type
conf
DOI
10.1109/SPIRE.1999.796573
Filename
796573
Link To Document