DocumentCode :
2483784
Title :
Input-independent, scalable and fast string matching on the Cray XMT
Author :
Villa, Oreste ; Chavarría-Miranda, Daniel ; Maschhoff, Kristyn
Author_Institution :
High-Performance Comput., Pacific Northwest Nat. Lab., Richland, WA, USA
fYear :
2009
fDate :
23-29 May 2009
Firstpage :
1
Lastpage :
12
Abstract :
String searching is at the core of many security and network applications like search engines, intrusion detection systems, virus scanners and spam filters. The growing size of on-line content and the increasing wire speeds push the need for fast, and often real-time, string searching solutions. For these conditions, many software implementations (if not all) targeting conventional cache-based microprocessors do not perform well. They either exhibit overall low performance or exhibit highly variable performance depending on the types of inputs. For this reason, real-time state of the art solutions rely on the use of either custom hardware or Field-Programmable Gate Arrays (FPGAs) at the expense of overall system flexibility and programmability. This paper presents a software based implementation of the Aho-Corasick string searching algorithm on the Cray XMT multithreaded shared memory machine. Our solution relies on the particular features of the XMT architecture and on several algorithmic strategies: it is fast, scalable and its performance is virtually content-independent. On a 128-processor Cray XMT, it reaches a scanning speed of ap 28 Gbps with a performance variability below 10%. In the 10 Gbps performance range, variability is below 2.5%. By comparison, an Intel dual-socket, 8-core system running at 2.66 GHz achieves a peak performance which varies from 500 Mbps to 10 Gbps depending on the type of input and dictionary size.
Keywords :
field programmable gate arrays; multi-threading; security of data; shared memory systems; string matching; Cray XMT; fast string matching; field-programmable gate arrays; input-independent matching; multithreaded shared memory machine; online content; scalable matching; Application software; Field programmable gate arrays; Filters; Hardware; Intrusion detection; Microprocessors; Real time systems; Search engines; Software performance; Wire;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location :
Rome
ISSN :
1530-2075
Print_ISBN :
978-1-4244-3751-1
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2009.5161043
Filename :
5161043
Link To Document :
بازگشت