Title :
Kangaroo: Accelerating String Matching by Running Multiple Collaborative Finite State Machines
Author :
Xiaofei Wang ; Bin Liu ; Junchen Jiang ; Yang Xu ; Yi Wang ; Xiaojun Wang
Author_Institution :
Tsinghua Univ., Beijing, China
Abstract :
String matching is a key technique for network security applications such as network intrusion detection systems and antivirus scanners, where the payload of every packet is inspected against thousands of patterns in real time. As the transmission rate of Internet links is getting higher and higher, the speed of matching engines is required to be faster and faster. Existing deterministic finite automaton (DFA)-based approaches achieve high throughput at the expense of extremely expensive memory cost; therefore, they are not suitable for the scenarios where only limited on-chip memory resources are available. To achieve fast matching speed while controlling memory expense, in this paper, we propose Kangaroo, a compact string matching scheme that scans multiple characters each time by running multiple small-sized finite state machines in parallel. Specifically, Kangaroo processes k consecutive characters mostly in one cycle by accessing k different memories in parallel, where k is a predefined factor that can be tuned based on the requirement of applications. Kangaroo is memory efficient. Experimental evaluations on Snort and ClamAV rule sets show that a tenfold increase in speed can be practically achieved by a single Kangaroo matching engine with a reduced memory cost comparing with the state-of-the-art DFA-based approaches.
Keywords :
deterministic automata; finite state machines; string matching; ClamAV; DFA; Internet links; Kangaroo scheme; Snort; antivirus scanners; collaborative finite state machines; deterministic finite automaton; network intrusion detection system; on-chip memory resources; string matching; Automata; Buildings; Collaboration; Educational institutions; Engines; Silicon; Throughput; Deep packet inspection; deterministic finite automaton; finite state machine;
Journal_Title :
Selected Areas in Communications, IEEE Journal on
DOI :
10.1109/JSAC.2014.2358841