• DocumentCode
    3588854
  • Title

    System Failure Prediction through Rare-Events Elastic-Net Logistic Regression

  • Author

    Navarro, Jose M. ; Parada, G. Hugo A. ; Duenas, Juan C.

  • Author_Institution
    Center for Open Middleware, Univ. Politec. de Madrid, Madrid, Spain
  • fYear
    2014
  • Firstpage
    120
  • Lastpage
    125
  • Abstract
    Predicting failures in a distributed system based on previous events through logistic regression is a standard approach in literature. This technique is not reliable, though, in two situations: in the prediction of rare events, which do not appear in enough proportion for the algorithm to capture, and in environments where there are too many variables, as logistic regression tends to over fit on this situations, while manually selecting a subset of variables to create the model is error-prone. On this paper, we solve an industrial research case that presented this situation with a combination of elastic net logistic regression, a method that allows us to automatically select useful variables, a process of cross-validation on top of it and the application of a rare events prediction technique to reduce computation time. This process provides two layers of cross-validation that automatically obtain the optimal model complexity and the optimal model parameters values, while ensuring even rare events will be correctly predicted with a low amount of training instances. We tested this method against real industrial data, obtaining a total of 60 out of 80 possible models with a 90% average model accuracy.
  • Keywords
    computational complexity; distributed processing; failure analysis; feature selection; regression analysis; computation time; cross-validation; distributed system; elastic net logistic regression; industrial research case; optimal model complexity; optimal model parameters value; predicting failure; rare events prediction technique; rare-events elastic-net logistic regression; system failure prediction; Complexity theory; Computational modeling; Data models; Logistics; Prediction algorithms; Predictive models; Training; Automatic Feature Selection; Logistic Regression; Machine Learning; Multivariable Prediction; Online Failure Prediction; System Management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence, Modelling and Simulation (AIMS), 2014 2nd International Conference on
  • Print_ISBN
    978-1-4799-7599-0
  • Type

    conf

  • DOI
    10.1109/AIMS.2014.19
  • Filename
    7102446