DocumentCode :
914386
Title :
An efficient algorithm for matching multiple patterns
Author :
Fan, Jang-Jong ; Su, Keh-Yih
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Volume :
5
Issue :
2
fYear :
1993
fDate :
4/1/1993 12:00:00 AM
Firstpage :
339
Lastpage :
351
Abstract :
An efficient algorithm for performing multiple pattern match in a string is described. The match algorithm combines the concept of deterministic finite state automata (DFSA) and the Boyer-Moore algorithm to achieve better performance. Experimental results indicate that in the average case, the algorithm is able to perform pattern match operations sublinearly, i.e. it does not need to inspect every character of the string to perform pattern match operations. The analysis shows that the number of characters to be inspected decreases as the length of patterns increases, and increases slightly as the total number of patterns increases. To match an eight-character pattern in an English string using the algorithm, only about 17% of all characters of the strong and 33% of all characters of the string, when the number of patterns is seven, are inspected. In an actual testing, the algorithm running on SUN 3/160 takes only 3.7 s to search seven eight-character patterns in a 1.4-Mbyte English text file
Keywords :
finite automata; pattern recognition; word processing; Boyer-Moore algorithm; English string; English text file; SUN 3/160; deterministic finite state automata; eight-character patterns; multiple pattern match; Automata; Bonding; Computational complexity; Councils; Information retrieval; Inspection; Pattern analysis; Pattern matching; Sun; Testing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.219740
Filename :
219740
Link To Document :
بازگشت