Title :
Good spaced seeds for homology search
Author :
Choi, Kwok Pui ; Zeng, Fanfan ; Zhang, Louxin
Author_Institution :
Dept. of Math., Nat. Univ. of Singapore, Singapore
Abstract :
Filtration is an important technique used to speed up local alignment as exemplified in the BLAST programs. Recently, Ma, Tromp and Li (2002) discovered that better filtering can be achieved by spacing out the matching positions according to a certain pattern, instead of contiguous positions to trigger a local alignment in their PatternHunter program. Such a match pattern is called a spaced seed. Our numerical computation shows that the ranks of spaced seeds (based on sensitivity) change with the sequences similarity. Since homologous sequences may have diverse similarity, we assess the sensitivity of spaced seeds over a range of similarity levels and present a list of good spaced seeds for facilitating homology search in DNA genomic sequences. We validate that the listed spaced seeds are indeed more sensitive using three arbitrarily chosen pairs of DNA genomic sequences.
Keywords :
DNA; biology computing; database management systems; genetics; molecular biophysics; proteins; query processing; sequences; DNA genomic sequences; PatternHunter program; filtration; homologous sequences; homology; similarity; spaced seeds; Bioinformatics; Biological cells; DNA; Filtration; Genomics; Mathematics; Pattern matching; Probability; Sequences; Statistics;
Conference_Titel :
Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
Print_ISBN :
0-7695-2173-8
DOI :
10.1109/BIBE.2004.1317368