DocumentCode
3598669
Title
Good spaced seeds for homology search
Author
Choi, Kwok Pui ; Zeng, Fanfan ; Zhang, Louxin
Author_Institution
Dept. of Math., Nat. Univ. of Singapore, Singapore
fYear
2004
Firstpage
379
Lastpage
386
Abstract
Filtration is an important technique used to speed up local alignment as exemplified in the BLAST programs. Recently, Ma, Tromp and Li (2002) discovered that better filtering can be achieved by spacing out the matching positions according to a certain pattern, instead of contiguous positions to trigger a local alignment in their PatternHunter program. Such a match pattern is called a spaced seed. Our numerical computation shows that the ranks of spaced seeds (based on sensitivity) change with the sequences similarity. Since homologous sequences may have diverse similarity, we assess the sensitivity of spaced seeds over a range of similarity levels and present a list of good spaced seeds for facilitating homology search in DNA genomic sequences. We validate that the listed spaced seeds are indeed more sensitive using three arbitrarily chosen pairs of DNA genomic sequences.
Keywords
DNA; biology computing; database management systems; genetics; molecular biophysics; proteins; query processing; sequences; DNA genomic sequences; PatternHunter program; filtration; homologous sequences; homology; similarity; spaced seeds; Bioinformatics; Biological cells; DNA; Filtration; Genomics; Mathematics; Pattern matching; Probability; Sequences; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
Print_ISBN
0-7695-2173-8
Type
conf
DOI
10.1109/BIBE.2004.1317368
Filename
1317368
Link To Document