• DocumentCode
    2826211
  • Title

    A fast algorithm for multi-string matching based on automata optimization

  • Author

    Hu, Yue ; Wang, Pei-Feng ; Hwang, Kai

  • Author_Institution
    Univ. of Sci. & Technol. Beijing, Beijing, China
  • Volume
    2
  • fYear
    2010
  • fDate
    21-24 May 2010
  • Abstract
    Multi-string matching requires to handle massive amount of data in pattern recognition, intrusion detection, and biological sequence analysis applications. This paper proposes a new algorithm to construct an optimal automation to achieve fast string matching. The algorithm consists of five steps: sorting, forming subtrees, encoding all subtrees, similarity checking, and completing all transitions. The algorithmic complexity is proven O(umk), where u is the number of the symbols in the alphabet set and m and k are the average length and the number of strings being matched. We report analytical results on the matching complexity. These results prove the efficiency and effectiveness of the optimized automata generated for fast matching of multiple strings.
  • Keywords
    computational complexity; encoding; sorting; string matching; trees (mathematics); algorithmic complexity; automata optimization; biological sequence analysis application; intrusion detection; multistring matching; optimal automation; pattern recognition; similarity checking; subtrees encoding; subtrees forming; subtrees sorting; transitions completing; Algorithm design and analysis; Automata; Automation; Biological information theory; Encoding; Hardware; Intrusion detection; Pattern matching; Pattern recognition; Sorting; String matching; algorithm complexity; automata theory; graph theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Computer and Communication (ICFCC), 2010 2nd International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-5821-9
  • Type

    conf

  • DOI
    10.1109/ICFCC.2010.5497448
  • Filename
    5497448