DocumentCode
433418
Title
Fast pattern detection in stream data
Author
Sheu, Simon ; Cheng, Chang-Yeng ; Chang, Alan
Author_Institution
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Volume
1
fYear
2005
fDate
28-30 March 2005
Firstpage
125
Abstract
Digital pollution is emerging as an overwhelming threat to the Internet, whose ubiquitous connectivity conversely cultivates the widespread outbreaks of such dirt. Considerable amount of human efforts and network resources are wasted at a little cost of the few polluters. To prevent flooding of the contamination, classical string matching schemes and their variants can be used to detect these patterns for removal. The speed of detection is crucial to this application. In this paper, we propose a novel pattern detection technique based on the decision tree induction to seek for significant improvement over the classical schemes. According to the intrinsic of the pattern, the tree is sprouted adaptively to minimize the number of symbols in the data stream needs to be examined. This allows a unique order to inspect the symbols in a strategic way optimized contextually, as opposed to the fixed order followed by the other schemes. Performance study indicates our approach achieves the speed-up of five or more over the best competitors.
Keywords
Internet; decision trees; search problems; string matching; Internet; data stream; decision tree induction; digital pollution; fast pattern detection; string matching; Computer science; Costs; Decision trees; Floods; Humans; Intelligent networks; Internet; Intrusion detection; Pattern matching; Pollution;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Information Networking and Applications, 2005. AINA 2005. 19th International Conference on
ISSN
1550-445X
Print_ISBN
0-7695-2249-1
Type
conf
DOI
10.1109/AINA.2005.184
Filename
1423481
Link To Document