DocumentCode :
2709029
Title :
Exploiting Local and Global Invariants for the Management of Large Scale Information Systems
Author :
Chen, Haifeng ; Cheng, Haibin ; Jiang, Guofei ; Yoshihira, Kenji
Author_Institution :
NEC Labs. America, Princeton, NJ
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
113
Lastpage :
122
Abstract :
This paper presents a data oriented approach to modeling the complex computing systems, in which an ensemble of correlation models are discovered to represent the system status. If the discovered correlations can continually hold under different user scenarios and workloads, they are regarded as invariants of the information system. In our previous work, we have developed an algorithm to automatically search the invariants between any pair of system attributes, which we call local invariants. However that method is unable to deal with the high order dependency models due to the combinatorial explosion of search space. In this paper we use Bayesian regression technique to discover those high order correlation models, called global invariants. We treat each attribute as a response variable in turn and express its dependency with the other attributes in a regression model. By adding the prior constraint of Laplacian distribution to the regression coefficients, we can find the solution in which only the correlated attributes with respect to the response have nonzero regression coefficients. After that we further consider the temporal dependencies of those extracted attributes by incorporating their past observations. We also provide a confidence metric and a validation procedure to measure the reliability of learned models. If the model does not break down in the validation, it is regarded as a true invariant of the system. Experimental results on a real wireless networking system show that the discovered invariants can be used to effectively detect system failures as well as provide valuable information about the failure source.
Keywords :
Bayes methods; Laplace transforms; data handling; regression analysis; Bayesian regression technique; Laplacian distribution; complex computing systems; confidence metric; large scale information systems management; wireless networking system; Bayesian methods; Data mining; Explosions; Frequency; Information systems; Laplace equations; Large-scale systems; Management information systems; Network servers; Power system modeling; Bayesian regression; dependency; failure detection; failure diagnosis; system management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.51
Filename :
4781106
Link To Document :
بازگشت