Title :
Estimating p-Values for Deviation Detection
Author :
Rognvaldsson, Thorsteinn ; Norrman, Henrik ; Byttner, Stefan ; Jarpe, Eric
Author_Institution :
Center for Appl. Intell. Syst. Res. (CAISR), Halmstad Univ., Halmstad, Sweden
Abstract :
Deviation detection is important for self-monitoring systems. To perform deviation detection well requires methods that, given only "normal" data from a distribution of unknown parametric form, can produce a reliable statistic for rejecting the null hypothesis, i.e. evidence for devating data. One measure of the strength of this evidence based on the data is the p-value, but few deviation detection methods utilize p-value estimation. We compare three methods that can be used to produce p-values: one class support vector machine (OCSVM), conformal anomaly detection (CAD), and a simple "most central pattern" (MCP) algorithm. The SVM and the CAD method should be able to handle a distribution of any shape. The methods are evaluated on synthetic data sets to test and illustrate their strengths and weaknesses, and on data from a real life self-monitoring scenario with a city bus fleet in normal traffic. The OCSVM has a Gaussian kernel for the synthetic data and a Hellinger kernel for the empirical data. The MCP method uses the Mahalanobis metric for the synthetic data and the Hellinger metric for the empirical data. The CAD uses the same metrics as the MCP method and has a k-nearest neighbour (kNN) non-conformity measure for both sets. The conclusion is that all three methods give reasonable, and quite similar, results on the real life data set but that they have clear strengths and weaknesses on the synthetic data sets. The MCP algorithm is quick and accurate when the "normal" data distribution is unimodal and symmetric (with the chosen metric) but not otherwise. The OCSVM is a bit cumbersome to use to create (quantized) p-values but is accurate and reliable when the data distribution is multimodal and asymmetric. The CAD is also accurate for multimodal and asymmetric distributions. The experiment on the vehicle data illustrate how algorithms like these can be used in a self-monitoring system that uses a fleet of vehicles to conduct deviation detection without supervisi- n and without prior knowledge about what is being monitored.
Keywords :
fault tolerant computing; learning (artificial intelligence); pattern classification; software reliability; support vector machines; system monitoring; CAD method; Gaussian kernel; Hellinger kernel; Hellinger metric; MCP algorithm; MCP method; Mahalanobis metric; OCSVM; asymmetric distribution; city bus fleet; conformal anomaly detection; deviation detection method; k-nearest neighbour nonconformity measure; kNN nonconformity measure; most central pattern algorithm; multimodal distribution; normal data distribution; normal traffic; null hypothesis; one class support vector machine; p-values estimation; quantized p-values; real life self-monitoring scenario; self-monitoring system; shape distribution; synthetic data set; vehicle data; Conferences; Design automation; Histograms; Kernel; Measurement; Training; Vehicles;
Conference_Titel :
Self-Adaptive and Self-Organizing Systems (SASO), 2014 IEEE Eighth International Conference on
Conference_Location :
London
DOI :
10.1109/SASO.2014.22