DocumentCode
2436159
Title
Towards identifying OS-level anomalies to detect application software failures
Author
Bovenzi, Antonio ; Russo, Stefano ; Brancati, Francesco ; Bondavalli, Andrea
Author_Institution
Dipt. di Inf. e Sist. (DIS), Univ. degli Studi di Napoli Federico II, Naples, Italy
fYear
2011
fDate
10-11 Oct. 2011
Firstpage
71
Lastpage
76
Abstract
The next generation of critical systems, namely complex Critical Infrastructures (LCCIs), require efficient runtime management, reconfiguration strategies, and the ability to take decisions on the basis of current and past behavior of the system. Anomaly-based detection, leveraging information gathered at Operating System (OS) level (e.g., number of system call errors, signals, and holding semaphores in the time unit), seems to be a promising approach to reveal online application faults. Recently an experimental campaign to evaluate the performance of two anomaly detection algorithms was performed on a case study from the Air Traffic Management (ATM) domain, deployed under the popular OS used in the production environment, i.e., Red Hat 5 EL. In this paper we investigate the impact of the OS and the monitored resources on the quality of the detection, by executing experiments on Windows Server 2008. Experimental results allow identifying which of the two operating systems provides monitoring facilities best suited to implement the anomaly detection algorithms that we have considered. Moreover numerical sensitivity analysis of the detector parameters is carried out to understand the impact of their setting on the performance.
Keywords
operating systems (computers); safety-critical software; sensitivity analysis; software performance evaluation; system monitoring; ATM domain; LCCI; OS level; OS-level anomaly; Windows Server 2008; air traffic management domain; anomaly detection algorithms; anomaly-based detection; complex critical infrastructures; critical systems; detector parameters; monitored resources; monitoring facility; next generation; online application faults; operating system level; operating systems; performance evaluation; production environment; reconfiguration strategy; runtime management; semaphores; sensitivity analysis; software failures; Accuracy; Detectors; Instruction sets; Linux; Measurement; Monitoring; Servers; OS-level monitoring; anomaly detection; software failure;
fLanguage
English
Publisher
ieee
Conference_Titel
Measurements and Networking Proceedings (M&N), 2011 IEEE International Workshop on
Conference_Location
Anacapri
Print_ISBN
978-1-4577-0455-0
Type
conf
DOI
10.1109/IWMN.2011.6088494
Filename
6088494
Link To Document