• DocumentCode
    2778969
  • Title

    Shift-based pattern matching for compressed web traffic

  • Author

    Bremler-Barr, Anat ; Koral, Yaron ; Zigdon, Victor

  • Author_Institution
    Comput. Sci. Dept., Interdiscipl. Center, Herzliya, Israel
  • fYear
    2011
  • fDate
    4-6 July 2011
  • Firstpage
    222
  • Lastpage
    229
  • Abstract
    Compressing web traffic using standard GZIP is becoming both popular and challenging due to the huge increase in wireless web devices, where bandwidth is limited. Security and other content based networking devices are required to decompress the traffic of tens of thousands concurrent connections in order to inspect the content for different signatures. The overhead imposed by the decompression inhibits most devices from handling compressed traffic, which in turn either limits traffic compression or introduces security holes and other dysfunctionalities. The ACCH algorithm was the first to present a unified approach to pattern matching and decompression, by taking advantage of information gathered in the decompression phase to accelerate the pattern matching. ACCH accelerated the DFA-based Aho-Corasick multi-pattern matching algorithm. In this paper, we present a novel algorithm, SPC (Shift-based Pattern matching for Compressed traffic) that accelerates the commonly used Wu-Manber pattern matching algorithm. SPC is simpler and has higher throughput and lower storage overhead than ACCH. Analysis of real web traffic and real security devices signatures shows that we can skip scanning up to 87.5% of the data and gain performance boost of more than 51% as compared to ACCH. Moreover, the additional storage requirement of the technique requires only 4KB additional information per connection as compared to 8KB of ACCH.
  • Keywords
    Internet; cellular radio; data compression; hypermedia; pattern matching; telecommunication security; telecommunication traffic; transport protocols; ACCH algorithm; DFA- based Aho-Corasick multipattern matching algorithm; SPC; Wu-Manber pattern matching algorithm; compressed Web traffic; concurrent connection; content based networking device; real security device signature; shift-based pattern matching; standard GZIP; storage requirement; wireless Web device; Acceleration; Algorithm design and analysis; Huffman coding; Inspection; Pattern matching; Security;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Switching and Routing (HPSR), 2011 IEEE 12th International Conference on
  • Conference_Location
    Cartagena
  • Print_ISBN
    978-1-4244-8454-6
  • Electronic_ISBN
    978-1-4244-8455-3
  • Type

    conf

  • DOI
    10.1109/HPSR.2011.5986030
  • Filename
    5986030