Title :
Hardware-Accelerated Regular Expression Matching with Overlap Handling on IBM PowerEN Processor
Author :
Atasu, Kubilay ; Doerfler, Florian ; van Lunteren, Jan ; Hagleitner, Christoph
Author_Institution :
IBM Res. - Zurich, Zurich, Switzerland
Abstract :
Programmable hardware accelerators for regular expression (regex) matching are evolving into increasingly complex stream processors, which involve multiple state machines that operate in parallel, and specialized post-processors that can process instructions dispatched by the state machines. To improve the speed and the storage-efficiency, complex regexs are decomposed into simpler subexpressions, where each subexpression can fire one or more instructions. Although the impact of regex decompositions on the storage efficiency is well-known, little has been done to address the correctness and completeness. We show that regex decompositions can result in false positives if overlaps between subexpressions are not taken into account. We describe formal methods to recognize various types of subexpression overlaps that can arise in regex decompositions. We also describe efficient post-processing techniques to eliminate the associated false positives. To enable efficient mapping of the decomposed regexs to the postprocessors, we propose integer programming based register allocation methods. Our methods pack narrow variables to reduce the register and instruction usage, and take advantage of multi-register reset instructions to reduce the number of instructions that must be executed in parallel. Experiments on regex sets obtained from open-source and proprietary network intrusion detection systems demonstrate orders of magnitude improvement in the storage efficiency over state-of-the-art.
Keywords :
finite state machines; integer programming; microprocessor chips; IBM PowerENTM Processor; complex stream processors; formal methods; hardware accelerated regular expression matching; integer programming; multiple state machines; overlap handling; post-processing techniques; programmable hardware accelerators; register allocation methods; storage efficiency; Automata; Doped fiber amplifiers; Hardware; Pattern matching; Radiation detectors; Registers; Throughput; automata theory; compilers; hardware accelerators; integer programming; regular expression matching;
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4673-6066-1
DOI :
10.1109/IPDPS.2013.54