• DocumentCode
    29287
  • Title

    Protein Sequence Pattern Matching: Leveraging Application Specific Hardware Accelerators

  • Author

    Manole, S. ; Golander, A. ; Weiss, Steven

  • Author_Institution
    IBM Haifa Res. Labs., Tel-Aviv Univ., Tel-Aviv, Israel
  • Volume
    63
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    448
  • Lastpage
    460
  • Abstract
    Digitalization has brought a tremendous momentum to health care research. Recognition of patterns in proteins is crucial for identifying possible functions of newly discovered proteins, as well as analysis of known proteins for previously undetermined activity. In this paper, the workload consists of locating patterns from the PROSITE database in protein sequences. We optimize the pattern search task by using a new breed of processors that merge network and server attributes. We leverage massive multithreading and regular-expression (RegX) hardware accelerators; the latter were designed and built for an entirely different application - high-bandwidth deep-packet inspection. Our multithreading optimization achieves 18x improvement, but by harnessing a RegX accelerator we were able to further demonstrate a significant 392x improvement relative to software pattern matching. Moreover, performance per area and power consumption are improved by multiple orders of magnitude as well.
  • Keywords
    biology computing; multi-threading; pattern matching; proteins; PROSITE database; application specific hardware accelerators; health care research; high-bandwidth deep-packet inspection; multithreading; network attributes; pattern recognition; power consumption; protein discovery; protein sequence pattern matching; regular-expression hardware accelerators; server attributes; software pattern matching; Hardware; Instruction sets; Multithreading; Pattern matching; Processor scheduling; Proteins; CMP; Hardware; Instruction sets; Multithreading; Pattern matching; Processor scheduling; Protein sequence; Proteins; SMT; hardware accelerator; multithreading; pattern matching;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2012.187
  • Filename
    6257365