• DocumentCode
    1996211
  • Title

    Fast String Matching with Overlapped Substring Classifier in Deep Packet Inspection Systems

  • Author

    Zhang Zhikai ; Zhao Youjian ; Yang Guanghui ; Zhang Xiaoping

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2010
  • fDate
    6-10 Dec. 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Traditional DFA based DPI (Deep Packet Inspection) string matching architectures either suffer from throughput bottleneck or unfeasible memory requirement, or both. Bloom Filter based schemes, on the other hand, only provide indefinite and unprecise match results. In this paper, we propose a novel string matching data structure called Overlapped Substring Classifier(OSC), which tries to compromise between these two ends. Instead of using incoming byte flow directly, we use OSC to extract the characteristic digest of the incoming string, which we demonstrate would be sufficient for locating a very small set of possible match, using DFA techniques. This type of match ambiguity and false-positive inaccuracy can be tuned to be negligible. The scheme is perfectly suitable for efficient and parallel hardware implementation, which makes ultra high performance and low memory usage simultaneously possible. A hardware architecture is also designed supporting single-threaded scanning rate of 10Gbp, with only moderate memory requirement and clock rate assumption.
  • Keywords
    computer network security; data structures; inspection; pattern classification; string matching; DFA techniques; DPI; Internet security; OSC; bloom filter based schemes; deep packet inspection systems; fast string matching; match ambiguity; overlapped substring classifier; string matching data structure; supporting single-threaded scanning; Bandwidth; Doped fiber amplifiers; Hardware; IEEE Communications Society; Matched filters; Memory management; Pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Telecommunications Conference (GLOBECOM 2010), 2010 IEEE
  • Conference_Location
    Miami, FL
  • ISSN
    1930-529X
  • Print_ISBN
    978-1-4244-5636-9
  • Electronic_ISBN
    1930-529X
  • Type

    conf

  • DOI
    10.1109/GLOCOM.2010.5683877
  • Filename
    5683877