DocumentCode :
244947
Title :
Fast and Exact Monitoring of Co-Evolving Data Streams
Author :
Matsubara, Yasuko ; Sakurai, Yasushi ; Ueda, Naonori ; Yoshikawa, Masatoshi
fYear :
2014
fDate :
14-17 Dec. 2014
Firstpage :
390
Lastpage :
399
Abstract :
Given a huge stream of multiple co-evolving sequences, such as motion capture and web-click logs, how can we find meaningful patterns and spot anomalies? Our aim is to monitor data streams statistically, and find sub sequences that have the characteristics of a given hidden Markov model (HMM). For example, consider an online web-click stream, where massive amounts of access logs of millions of users are continuously generated every second. So how can we find meaningful building blocks and typical access patterns such as weekday/weekend patterns, and also, detect anomalies and intrusions? In this paper, we propose Stream Scan, a fast and exact algorithm for monitoring multiple co-evolving data streams. Our method has the following advantages: (a) it is effective, leading to novel discoveries and surprising outliers, (b) it is exact, and we theoretically prove that Stream Scan guarantees the exactness of the output, (c) it is fast, and requires O (1) time and space per time-tick. Our experiments on 67GB of real data illustrate that Stream Scan does indeed detect the qualifying subsequence patterns correctly and that it can offer great improvements in speed (up to 479,000 times) over its competitors.
Keywords :
computational complexity; hidden Markov models; media streaming; HMM; StreamScan; access pattern detection; co-evolving data stream monitoring; hidden Markov model; Algorithm design and analysis; Heuristic algorithms; Hidden Markov models; Legged locomotion; Monitoring; Periodic structures; Viterbi algorithm; Co-evolving data streams; Hidden Markov models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
ISSN :
1550-4786
Print_ISBN :
978-1-4799-4303-6
Type :
conf
DOI :
10.1109/ICDM.2014.62
Filename :
7023356
Link To Document :
بازگشت