DocumentCode
29287
Title
Protein Sequence Pattern Matching: Leveraging Application Specific Hardware Accelerators
Author
Manole, S. ; Golander, A. ; Weiss, Steven
Author_Institution
IBM Haifa Res. Labs., Tel-Aviv Univ., Tel-Aviv, Israel
Volume
63
Issue
2
fYear
2014
fDate
Feb. 2014
Firstpage
448
Lastpage
460
Abstract
Digitalization has brought a tremendous momentum to health care research. Recognition of patterns in proteins is crucial for identifying possible functions of newly discovered proteins, as well as analysis of known proteins for previously undetermined activity. In this paper, the workload consists of locating patterns from the PROSITE database in protein sequences. We optimize the pattern search task by using a new breed of processors that merge network and server attributes. We leverage massive multithreading and regular-expression (RegX) hardware accelerators; the latter were designed and built for an entirely different application - high-bandwidth deep-packet inspection. Our multithreading optimization achieves 18x improvement, but by harnessing a RegX accelerator we were able to further demonstrate a significant 392x improvement relative to software pattern matching. Moreover, performance per area and power consumption are improved by multiple orders of magnitude as well.
Keywords
biology computing; multi-threading; pattern matching; proteins; PROSITE database; application specific hardware accelerators; health care research; high-bandwidth deep-packet inspection; multithreading; network attributes; pattern recognition; power consumption; protein discovery; protein sequence pattern matching; regular-expression hardware accelerators; server attributes; software pattern matching; Hardware; Instruction sets; Multithreading; Pattern matching; Processor scheduling; Proteins; CMP; Hardware; Instruction sets; Multithreading; Pattern matching; Processor scheduling; Protein sequence; Proteins; SMT; hardware accelerator; multithreading; pattern matching;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2012.187
Filename
6257365
Link To Document