• DocumentCode
    1641121
  • Title

    Optimizing performance, cost, and sensitivity in pairwise sequence search on a cluster of PlayStations

  • Author

    Aji, Ashwin M. ; Feng, Wu-chun

  • Author_Institution
    Dept. of Comput. Sci., Virginia Tech, Blacksburg, VA
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The Smith-Waterman algorithm is a dynamic programming method for determining optimal local alignments between nucleotide or protein sequences. However, it suffers from quadratic time and space complexity. As a result, many algorithmic and architectural enhancements have been proposed to solve this problem, but at the cost of reduced sensitivity in the algorithms or significant expense in hardware, respectively. This paper presents a highly efficient parallelization of the Smith-Waterman algorithm on the Cell Broadband Engine platform, a novel hybrid multicore architecture that drives the low-cost PlayStation 3 (PS3) game consoles as well as the IBM BladeCenter Q22, which currently powers the fastest supercomputer in the world, Roadrunner at Los Alamos National Laboratory. Through an innovative mapping of the optimal Smith-Waterman algorithm onto a cluster of PlayStation 3 nodes, our implementation delivers 21 to 55-fold speed-up over a high-end multicore architecture and up to 449-fold speed-up over the PowerPC processor in the PS3. Next, we evaluate the trade-offs between our Smith- Waterman implementation on the Cell with existing software and hardware implementations and show that our solution achieves the best performance-to-price ratio, when aligning realistic sequences sizes and generating the actual alignment. Finally, we show that our low-cost solution on a PS3 cluster approaches the speed of BLAST while achieving ideal sensitivity. To quantify the relationship between the two algorithms in terms of speed and sensitivity, we formally define and quantify the sensitivity of homology search methods so that trade-offs between sequence-search solutions can be evaluated in a quantitative manner.
  • Keywords
    bioinformatics; dynamic programming; parallel algorithms; parallel machines; proteins; BLAST; Cell Broadband Engine platform; Los Alamos National Laboratory; PS3 game console; PlayStations cluster; Roadrunner supercomputer; Smith-Waterman algorithm; cost optimization; dynamic programming; nucleotide sequences; pairwise sequence search; parallelization; performance optimization; protein sequences; sensitivity optimization; Clustering algorithms; Cost function; Dynamic programming; Engines; Hardware; Heuristic algorithms; Laboratories; Multicore processing; Proteins; Supercomputers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4244-2844-1
  • Electronic_ISBN
    978-1-4244-2845-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2008.4696723
  • Filename
    4696723