• DocumentCode
    3598669
  • Title

    Good spaced seeds for homology search

  • Author

    Choi, Kwok Pui ; Zeng, Fanfan ; Zhang, Louxin

  • Author_Institution
    Dept. of Math., Nat. Univ. of Singapore, Singapore
  • fYear
    2004
  • Firstpage
    379
  • Lastpage
    386
  • Abstract
    Filtration is an important technique used to speed up local alignment as exemplified in the BLAST programs. Recently, Ma, Tromp and Li (2002) discovered that better filtering can be achieved by spacing out the matching positions according to a certain pattern, instead of contiguous positions to trigger a local alignment in their PatternHunter program. Such a match pattern is called a spaced seed. Our numerical computation shows that the ranks of spaced seeds (based on sensitivity) change with the sequences similarity. Since homologous sequences may have diverse similarity, we assess the sensitivity of spaced seeds over a range of similarity levels and present a list of good spaced seeds for facilitating homology search in DNA genomic sequences. We validate that the listed spaced seeds are indeed more sensitive using three arbitrarily chosen pairs of DNA genomic sequences.
  • Keywords
    DNA; biology computing; database management systems; genetics; molecular biophysics; proteins; query processing; sequences; DNA genomic sequences; PatternHunter program; filtration; homologous sequences; homology; similarity; spaced seeds; Bioinformatics; Biological cells; DNA; Filtration; Genomics; Mathematics; Pattern matching; Probability; Sequences; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
  • Print_ISBN
    0-7695-2173-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2004.1317368
  • Filename
    1317368