DocumentCode :
3166147
Title :
Efficient Invariant Search for Distributed Information Systems
Author :
Yong Ge ; Guofei Jiang ; Yuan Ge
Author_Institution :
Univ. of North Carolina at Charlotte, Charlotte, NC, USA
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
1049
Lastpage :
1054
Abstract :
In today´s distributed information systems, a large amount of monitoring data such as log files have been collected. These monitoring data at various points of a distributed information system provide unparallel opportunities for us to characterize and track the information system via effectively correlating all monitoring data across the distributed system. Jiang1 proposed a concept named flow intensity to measure the intensity with which the monitoring data reacts to the volume of different user requests. The Autoregressive model with exogenous inputs (ARX) was used to quantify the relationship between each pair of flow intensity measured at various points across distributed systems. If such relationships hold all the time, they are considered as invariants of the underlying systems. Such invariants have been successfully used to characterize complex systems and support various system management tasks, such as system fault detection and localization. However, it is very time-consuming to search the complete set of invariants of large scale systems and existing algorithms are not scalable for thousands of flow intensity measurements. To this end, in this paper, we develop effective pruning techniques based on the identified upper bounds. Accordingly, two efficient algorithms are proposed to search the complete set of invariants based on the pruning techniques. Finally we demonstrate the efficiency and effectiveness of our algorithms with both real-world and synthetic data sets.
Keywords :
autoregressive processes; distributed processing; information systems; ARX; autoregressive model with exogenous inputs; distributed information systems; flow intensity measurements; information system characterization; information system tracking; invariant search; monitoring data; pruning technique; real-world data set; synthetic data set; Computational modeling; Distributed information systems; Equations; Mathematical model; Monitoring; Time series analysis; Upper bound; ARX Model; AutoRegressive Model; Efficient Search; Invariant;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2013.133
Filename :
6729596
Link To Document :
بازگشت