DocumentCode :
474869
Title :
Advanced algorithms for fast and scalable deep packet inspection
Author :
Kumar, Sailesh ; Turner, Jonathan ; Williams, John
Author_Institution :
Washington Univ., St. Louis, MO
fYear :
2006
fDate :
3-5 Dec. 2006
Firstpage :
81
Lastpage :
92
Abstract :
Modern deep packet inspection systems use regular expressions to define various patterns of interest in network data streams. Deterministic finite automata (DFA) are commonly used to parse regular expressions. DFAs are fast, but can require prohibitively large amounts of memory for patterns arising in network applications. Traditional DFA table compression only slightly reduces the memory required and requires an additional memory access per input character. Alternative representations of regular expressions, such as NFAs and delayed input DFAs (D2FA) require less memory but sacrifice throughput. In this paper we introduce the content addressed delayed input DFA (CD2FA), which provides a compact representation of regular expressions that match the throughput of traditional uncompressed DFAs. A CD2FA addresses successive states of a D2FA using their content, rather than a "content-less" identifier. This makes selected information available earlier in the state traversal process, which makes it possible to avoid unnecessary memory accesses. We demonstrate that such content-addressing can be effectively used to obtain automata that are very compact and can achieve high throughput. Specifically, we show that for an application using thousands of patterns defined by regular expressions, CD2FAs use as little as 10% of the space required by a conventional compressed DFA, and match the throughput of an uncompressed DFA.
Keywords :
computer networks; data compression; deterministic automata; finite automata; grammars; storage management; content addressed delayed input; content addressing; deep packet inspection; deterministic finite automata; expression representation; memory access; network applications; network data streams; regular expression parsing; state traversal process; table compression; Added delay; Automata; Computer security; Doped fiber amplifiers; Information security; Inspection; Operating systems; Pattern matching; Permission; Throughput; DFA; deep packet inspection; regular expressions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Architecture for Networking and Communications systems, 2006. ANCS 2006. ACM/IEEE Symposium on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-59593-580-9
Type :
conf
Filename :
4579526
Link To Document :
بازگشت