DocumentCode :
1241311
Title :
Toward Automated Anomaly Identification in Large-Scale Systems
Author :
Lan, Zhiling ; Zheng, Ziming ; Li, Yawei
Author_Institution :
Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
Volume :
21
Issue :
2
fYear :
2010
Firstpage :
174
Lastpage :
187
Abstract :
When a system fails to function properly, health-related data are collected for troubleshooting. However, it is challenging to effectively identify anomalies from the voluminous amount of noisy, high-dimensional data. The traditional manual approach is time-consuming, error-prone, and even worse, not scalable. In this paper, we present an automated mechanism for node-level anomaly identification in large-scale systems. A set of techniques is presented to automatically analyze collected data: data transformation to construct a uniform data format for data analysis, feature extraction to reduce data size, and unsupervised learning to detect the nodes acting differently from others. Moreover, we compare two techniques, principal component analysis (PCA) and independent component analysis (ICA), for feature extraction. We evaluate our prototype implementation by injecting a variety of faults into a production system at NCSA. The results show that our mechanism, in particular, the one using ICA-based feature extraction, can effectively identify faulty nodes with high accuracy and low computation overhead.
Keywords :
feature extraction; independent component analysis; large-scale systems; principal component analysis; security of data; unsupervised learning; automated anomaly identification; data transformation technique; feature extraction technique; independent component analysis; large-scale systems; principal component analysis; unsupervised learning technique; Anomaly identification; independent component analysis; large-scale systems; outlier detection.; principal component analysis;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2009.52
Filename :
4815224
Link To Document :
بازگشت