DocumentCode
1255837
Title
Aho-Corasick String Matching on Shared and Distributed-Memory Parallel Architectures
Author
Tumeo, Antonino ; Villa, Oreste ; Chavarría-Miranda, Daniel G.
Author_Institution
High Performance Comput. Group, Pacific Northwest Nat. Lab. (PNNL), Richland, WA, USA
Volume
23
Issue
3
fYear
2012
fDate
3/1/2012 12:00:00 AM
Firstpage
436
Lastpage
443
Abstract
String matching requires a combination of (sometimes all) the following characteristics: high and/or predictable performance, support for large data sets and flexibility of integration and customization. This paper compares several software-based implementations of the Aho-Corasick algorithm for high-performance systems. We focus on the matching of unknown inputs streamed from a single source, typical of security applications and difficult to manage since the input cannot be preprocessed to obtain locality. We consider shared-memory architectures (Niagara 2, x86 multiprocessors, and Cray XMT) and distributed-memory architectures with homogeneous (InfiniBand cluster of x86 multicores) or heterogeneous processing elements (InfiniBand cluster of x86 multicores with NVIDIA Tesla C1060 GPUs). We describe how each solution achieves the objectives of supporting large dictionaries, sustaining high performance, and enabling customization and flexibility using various data sets.
Keywords
distributed shared memory systems; parallel architectures; string matching; Aho-Corasick string matching; Cray XMT; InfiniBand cluster; NVIDIA Tesla C1060 GPU; Niagara 2; distributed-memory parallel architectures; heterogeneous processing elements; high-performance systems; homogeneous processing elements; shared-memory parallel architectures; x86 multiprocessors; Algorithm design and analysis; Computer architecture; Dictionaries; Doped fiber amplifiers; Graphics processing unit; Instruction sets; Aho-Corasick; Cray XMT; GPGPU; high-performance computing.; multithreaded architectures; string matching;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2011.181
Filename
5928330
Link To Document