DocumentCode
3588854
Title
System Failure Prediction through Rare-Events Elastic-Net Logistic Regression
Author
Navarro, Jose M. ; Parada, G. Hugo A. ; Duenas, Juan C.
Author_Institution
Center for Open Middleware, Univ. Politec. de Madrid, Madrid, Spain
fYear
2014
Firstpage
120
Lastpage
125
Abstract
Predicting failures in a distributed system based on previous events through logistic regression is a standard approach in literature. This technique is not reliable, though, in two situations: in the prediction of rare events, which do not appear in enough proportion for the algorithm to capture, and in environments where there are too many variables, as logistic regression tends to over fit on this situations, while manually selecting a subset of variables to create the model is error-prone. On this paper, we solve an industrial research case that presented this situation with a combination of elastic net logistic regression, a method that allows us to automatically select useful variables, a process of cross-validation on top of it and the application of a rare events prediction technique to reduce computation time. This process provides two layers of cross-validation that automatically obtain the optimal model complexity and the optimal model parameters values, while ensuring even rare events will be correctly predicted with a low amount of training instances. We tested this method against real industrial data, obtaining a total of 60 out of 80 possible models with a 90% average model accuracy.
Keywords
computational complexity; distributed processing; failure analysis; feature selection; regression analysis; computation time; cross-validation; distributed system; elastic net logistic regression; industrial research case; optimal model complexity; optimal model parameters value; predicting failure; rare events prediction technique; rare-events elastic-net logistic regression; system failure prediction; Complexity theory; Computational modeling; Data models; Logistics; Prediction algorithms; Predictive models; Training; Automatic Feature Selection; Logistic Regression; Machine Learning; Multivariable Prediction; Online Failure Prediction; System Management;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence, Modelling and Simulation (AIMS), 2014 2nd International Conference on
Print_ISBN
978-1-4799-7599-0
Type
conf
DOI
10.1109/AIMS.2014.19
Filename
7102446
Link To Document