• DocumentCode
    2038258
  • Title

    Towards scalable optimal sequence homology detection

  • Author

    Daily, Jeff ; Krishnamoorthy, Sriram ; Kalyanaraman, Ananth

  • Author_Institution
    Pacific Northwest Nat. Lab., Richland, WA, USA
  • fYear
    2012
  • fDate
    18-22 Dec. 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The field of bioinformatics and computational biology is experiencing a data revolution - experimental techniques to procure data have increased in throughput improved in accuracy and reduced in costs. This has spurred an array of high profile sequencing and data generation projects. While the data repositories represent untapped reservoirs of rich information critical for scientific breakthroughs the analytical software tools that are needed to analyze large volumes of such sequence data have significantly lagged behind in their capacity to scale. In this paper we address homology detection which is a fundamental problem in large-scale sequence analysis with numerous applications. We present a scalable framework to conduct large-scale optimal homology detection on massively parallel super-computing platforms. Our approach employs distributed memory work stealing to effectively parallelize optimal pairwise alignment computation tasks. Results on 120,000 cores of the Hopper Cray XE6 supercomputer demonstrate strong scaling and up to 2.42 × 107 optimal pairwise sequence alignments computed per second (PSAPS) the highest reported in the literature.
  • Keywords
    Cray computers; DNA; bioinformatics; distributed memory systems; mainframes; parallel machines; pattern recognition; Hopper Cray XE6 supercomputer; analytical software tools; bioinformatics; computational biology; data repositories; distributed memory work stealing; large-scale sequence analysis; massively parallel super-computing platforms; optimal pairwise alignment computation task parallelism; pairwise sequence alignments computed per second; scalable optimal sequence homology detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing (HiPC), 2012 19th International Conference on
  • Conference_Location
    Pune
  • Print_ISBN
    978-1-4673-2372-7
  • Electronic_ISBN
    978-1-4673-2370-3
  • Type

    conf

  • DOI
    10.1109/HiPC.2012.6507523
  • Filename
    6507523