• DocumentCode
    3433271
  • Title

    Generalized Aho-Corasick Algorithm for Signature Based Anti-Virus Applications

  • Author

    Lee, Tsern-Huei

  • Author_Institution
    Nat. Chiao Tung Univ., Hsinchu
  • fYear
    2007
  • fDate
    13-16 Aug. 2007
  • Firstpage
    792
  • Lastpage
    797
  • Abstract
    Because of its accuracy, signature matching is considered an important technique in anti-virus/worm applications. Among some famous pattern matching algorithms, the Aho-Corasick (AC) algorithm can match multiple patterns simultaneously and guarantee deterministic performance under all circumstances and thus is widely adopted in various systems, especially when worst-case performance such as wire speed requirement is a design factor. However, the AC algorithm was developed only for strings while virus/worm signatures could be specified by simple regular expressions. In this paper, we generalize the AC algorithm to systematically construct a finite state pattern matching machine which can indicate the ending position in a finite input string for the first occurrence of virus/worm signatures that are specified by strings or simple regular expressions. The regular expressions studied in this paper may contain the following operators: * (match any number of symbols), ? (match any symbol), and {min, max} (match minimum of min, maximum of max symbols), which are defined in ClamAV, a popular open source anti-virus/worm software module, for signature specification.
  • Keywords
    computer viruses; finite state machines; pattern matching; Aho-Corasick algorithm; ClamAV; antivirus/worm software module; pattern matching; signature matching; Algorithm design and analysis; Automata; Doped fiber amplifiers; Internet; Open source software; Pattern analysis; Pattern matching; Protocols; Viruses (medical); Wire;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications and Networks, 2007. ICCCN 2007. Proceedings of 16th International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1095-2055
  • Print_ISBN
    978-1-4244-1251-8
  • Electronic_ISBN
    1095-2055
  • Type

    conf

  • DOI
    10.1109/ICCCN.2007.4317914
  • Filename
    4317914