DocumentCode :
9232
Title :
Geometric Monitoring of Heterogeneous Streams
Author :
Keren, Doron ; Sagy, Guy ; Abboud, A. ; Ben-David, David ; Schuster, Assaf ; Sharfman, Izchak ; Deligiannakis, Antonios
Author_Institution :
Dept. of Comput. Sci., Haifa Univ., Haifa, Israel
Volume :
26
Issue :
8
fYear :
2014
fDate :
Aug. 2014
Firstpage :
1890
Lastpage :
1903
Abstract :
Interest in stream monitoring is shifting toward the distributed case. In many applications the data is high volume, dynamic, and distributed, making it infeasible to collect the distinct streams to a central node for processing. Often, the monitoring problem consists of determining whether the value of a global function, defined on the union of all streams, crossed a certain threshold. We wish to reduce communication by transforming the global monitoring to the testing of local constraints, checked independently at the nodes. Geometric monitoring (GM) proved useful for constructing such local constraints for general functions. Alas, in GM the constraints at all nodes share an identical structure and are thus unsuitable for handling heterogeneous streams. Therefore, we propose a general approach for monitoring heterogeneous streams (HGM), which defines constraints tailored to fit the data distributions at the nodes. While we prove that optimally selecting the constraints is NP-hard, we provide a practical solution, which reduces the running time by hierarchically clustering nodes with similar data distributions and then solving simpler optimization problems. We also present a method for efficiently recovering from local violations at the nodes. Experiments yield an improvement of over an order of magnitude in communication relative to GM.
Keywords :
computational complexity; data handling; optimisation; pattern clustering; GM; HGM; NP-hard constraints; data distributions; distributed case; geometric monitoring; global function; heterogeneous streams; hierarchical node clustering; local constraints; optimization problems; stream monitoring; Correlation; Data models; Distributed databases; Monitoring; Nickel; Optimization; Vectors; Data Streams; Distributed Streams; Geometric Monitoring; Heterogeneous data streams; data modeling; distributed streams; geometric monitoring; safe zones;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2013.180
Filename :
6678505
Link To Document :
بازگشت