DocumentCode
2778969
Title
Shift-based pattern matching for compressed web traffic
Author
Bremler-Barr, Anat ; Koral, Yaron ; Zigdon, Victor
Author_Institution
Comput. Sci. Dept., Interdiscipl. Center, Herzliya, Israel
fYear
2011
fDate
4-6 July 2011
Firstpage
222
Lastpage
229
Abstract
Compressing web traffic using standard GZIP is becoming both popular and challenging due to the huge increase in wireless web devices, where bandwidth is limited. Security and other content based networking devices are required to decompress the traffic of tens of thousands concurrent connections in order to inspect the content for different signatures. The overhead imposed by the decompression inhibits most devices from handling compressed traffic, which in turn either limits traffic compression or introduces security holes and other dysfunctionalities. The ACCH algorithm was the first to present a unified approach to pattern matching and decompression, by taking advantage of information gathered in the decompression phase to accelerate the pattern matching. ACCH accelerated the DFA-based Aho-Corasick multi-pattern matching algorithm. In this paper, we present a novel algorithm, SPC (Shift-based Pattern matching for Compressed traffic) that accelerates the commonly used Wu-Manber pattern matching algorithm. SPC is simpler and has higher throughput and lower storage overhead than ACCH. Analysis of real web traffic and real security devices signatures shows that we can skip scanning up to 87.5% of the data and gain performance boost of more than 51% as compared to ACCH. Moreover, the additional storage requirement of the technique requires only 4KB additional information per connection as compared to 8KB of ACCH.
Keywords
Internet; cellular radio; data compression; hypermedia; pattern matching; telecommunication security; telecommunication traffic; transport protocols; ACCH algorithm; DFA- based Aho-Corasick multipattern matching algorithm; SPC; Wu-Manber pattern matching algorithm; compressed Web traffic; concurrent connection; content based networking device; real security device signature; shift-based pattern matching; standard GZIP; storage requirement; wireless Web device; Acceleration; Algorithm design and analysis; Huffman coding; Inspection; Pattern matching; Security;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Switching and Routing (HPSR), 2011 IEEE 12th International Conference on
Conference_Location
Cartagena
Print_ISBN
978-1-4244-8454-6
Electronic_ISBN
978-1-4244-8455-3
Type
conf
DOI
10.1109/HPSR.2011.5986030
Filename
5986030
Link To Document