DocumentCode
2826211
Title
A fast algorithm for multi-string matching based on automata optimization
Author
Hu, Yue ; Wang, Pei-Feng ; Hwang, Kai
Author_Institution
Univ. of Sci. & Technol. Beijing, Beijing, China
Volume
2
fYear
2010
fDate
21-24 May 2010
Abstract
Multi-string matching requires to handle massive amount of data in pattern recognition, intrusion detection, and biological sequence analysis applications. This paper proposes a new algorithm to construct an optimal automation to achieve fast string matching. The algorithm consists of five steps: sorting, forming subtrees, encoding all subtrees, similarity checking, and completing all transitions. The algorithmic complexity is proven O(umk), where u is the number of the symbols in the alphabet set and m and k are the average length and the number of strings being matched. We report analytical results on the matching complexity. These results prove the efficiency and effectiveness of the optimized automata generated for fast matching of multiple strings.
Keywords
computational complexity; encoding; sorting; string matching; trees (mathematics); algorithmic complexity; automata optimization; biological sequence analysis application; intrusion detection; multistring matching; optimal automation; pattern recognition; similarity checking; subtrees encoding; subtrees forming; subtrees sorting; transitions completing; Algorithm design and analysis; Automata; Automation; Biological information theory; Encoding; Hardware; Intrusion detection; Pattern matching; Pattern recognition; Sorting; String matching; algorithm complexity; automata theory; graph theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Future Computer and Communication (ICFCC), 2010 2nd International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-5821-9
Type
conf
DOI
10.1109/ICFCC.2010.5497448
Filename
5497448
Link To Document