Title :
A fast algorithm for multi-string matching based on automata optimization
Author :
Hu, Yue ; Wang, Pei-Feng ; Hwang, Kai
Author_Institution :
Univ. of Sci. & Technol. Beijing, Beijing, China
Abstract :
Multi-string matching requires to handle massive amount of data in pattern recognition, intrusion detection, and biological sequence analysis applications. This paper proposes a new algorithm to construct an optimal automation to achieve fast string matching. The algorithm consists of five steps: sorting, forming subtrees, encoding all subtrees, similarity checking, and completing all transitions. The algorithmic complexity is proven O(umk), where u is the number of the symbols in the alphabet set and m and k are the average length and the number of strings being matched. We report analytical results on the matching complexity. These results prove the efficiency and effectiveness of the optimized automata generated for fast matching of multiple strings.
Keywords :
computational complexity; encoding; sorting; string matching; trees (mathematics); algorithmic complexity; automata optimization; biological sequence analysis application; intrusion detection; multistring matching; optimal automation; pattern recognition; similarity checking; subtrees encoding; subtrees forming; subtrees sorting; transitions completing; Algorithm design and analysis; Automata; Automation; Biological information theory; Encoding; Hardware; Intrusion detection; Pattern matching; Pattern recognition; Sorting; String matching; algorithm complexity; automata theory; graph theory;
Conference_Titel :
Future Computer and Communication (ICFCC), 2010 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5821-9
DOI :
10.1109/ICFCC.2010.5497448