• DocumentCode
    3410947
  • Title

    DASH: localising dynamic programming for order of magnitude faster, accurate sequence alignment

  • Author

    Gardner-Stephen, Paul ; Knowles, Greg

  • Author_Institution
    Flinders Univ. of South Australia, Adelaide, SA, Australia
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    732
  • Lastpage
    735
  • Abstract
    In this paper we present our genomic and proteomic sequence alignment algorithm, DASH, which results in order of magnitude speed improvement when compared to NCBI-BLAST 2.2.6, with superior sensitivity. Dynamic programming (DP) is the predominant contributor to search time for algorithms such as BLAST and FastA/P. Improving the efficiency of DP provides an opportunity to increase sensitivity, or significantly reduce search times and help offset the effects of the continuing exponential growth in database sizes. Specifically, for nucleotide searching we have demonstrated an order of magnitude speed improvement with significantly improved sensitivity, or alternatively moderate speed up with further sensitivity gains, depending on the parameters selected. Smith-Waterman complete DP is used as the sensitivity benchmark. Similar speed and sensitivity results are presented for protein searching. Since our algorithm is highly parallel, we have developed dedicated hardware which we will present in a companion paper, and a distributed version of our software (DDASH), which we expect to provide linear speedup on a cluster.
  • Keywords
    biology computing; dynamic programming; genetics; molecular biophysics; proteins; search problems; DASH; FastA/P; NCBI-BLAST 2.2.6; Smith-Waterman complete DP; dedicated hardware; distributed DASH; dynamic programming; fast accurate sequence alignment; genomic sequence alignment algorithm; nucleotide searching; protein searching; proteomic sequence alignment algorithm; search time; Bioinformatics; Clustering algorithms; Databases; Dynamic programming; Embedded system; Genomics; Heuristic algorithms; Informatics; Laboratories; Proteomics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332562
  • Filename
    1332562