Title :
PaREM: A Novel Approach for Parallel Regular Expression Matching
Author :
Memeti, Suejb ; Pllana, Sabri
Author_Institution :
Dept. of Comput. Sci., Linnaeus Univ., Vaxjo, Sweden
Abstract :
Regular expression matching is essential for many applications, such as finding patterns in text, exploring substrings in large DNA sequences, or lexical analysis. However, sequential regular expression matching may be time-prohibitive for large problem sizes. In this paper, we describe a novel algorithm for parallel regular expression matching via deterministic finite automata. Furthermore, we present our tool PaREM that accepts regular expressions and finite automata as input and automatically generates the corresponding code for our algorithm that is amenable for parallel execution on shared-memory systems. We evaluate our parallel algorithm empirically by comparing it with a commonly used algorithm for sequential regular expression matching. Experiments on a dual-socket shared-memory system with 24 physical cores show speed-ups of up to 21× for 48 threads.
Keywords :
finite automata; parallel processing; shared memory systems; PaREM; deterministic finite automata; dual-socket shared-memory system; parallel regular expression matching; sequential regular expression matching; Abstracts; Automata; Instruction sets; Multicore processing; Parallel algorithms; Sparse matrices; Syntactics; finite automata; multi-core; parallel processing; regular expression;
Conference_Titel :
Computational Science and Engineering (CSE), 2014 IEEE 17th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4799-7980-6
DOI :
10.1109/CSE.2014.146