DocumentCode
573292
Title
An Anomaly Detection Algorithm Based on Lossless Compression
Author
Wang, Nan ; Han, Jizhong ; Fang, Jinyun
Author_Institution
Inst. of Comput. Technol., Beijing, China
fYear
2012
fDate
28-30 June 2012
Firstpage
31
Lastpage
38
Abstract
Anomaly detection is essential in network security. It has been researched for decades. Many anomaly detection methods have been proposed. Because of the simplicity of principles, statistical and Markovian methods dominate these approaches. However, their effectiveness is constrained by specific preconditions, which make them work for only appropriate data sets which satisfy their premises. Other than statistical and Markovian model, information theory provides a different perspective about anomaly detection. However, the computation of information theoretic measures is still based on statistics. In this paper, we present a novel, information theoretic anomaly detection framework. Instead of statistics, it employs lossless compression for measuring the information quantity, and detects outliers according to compression result. We also discuss the selection of underlying compression algorithm, and choose a grammar compression for utilizing the structure of data. With grammar compression, our method overcomes the shortcomings of statistical and Markovian methods. In addition, the implementation and operation of our method is even simpler than traditional approaches. We test our method on four data sets about text analyzing, host intrusion detection and bug detection. Experimental results show that, even traditional methods fail in some situations, our simple method works well in all cases.
Keywords
Markov processes; information theory; security of data; telecommunication security; Markovian method; Markovian model; anomaly detection algorithm; anomaly detection method; bug detection; compression algorithm; data structure; grammar compression; information quantity; information theoretic anomaly detection framework; information theoretic measures; information theory; intrusion detection; lossless compression; network security; Compression algorithms; Entropy; Grammar; Hidden Markov models; Markov processes; Statistical analysis; Training; anomaly detection; data mining; grammar-based compression;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking, Architecture and Storage (NAS), 2012 IEEE 7th International Conference on
Conference_Location
Xiamen, Fujian
Print_ISBN
978-1-4673-1889-1
Type
conf
DOI
10.1109/NAS.2012.8
Filename
6310873
Link To Document