DocumentCode
3712882
Title
An automatic approach to extract the formats of network and security log messages
Author
Jing Ya; Tingwen Liu; Haoliang Zhang; Jinqiao Shi;Li Guo
Author_Institution
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
fYear
2015
Firstpage
1542
Lastpage
1547
Abstract
Analyzing massive network and security logs that record network events is crucial for diagnosing network anomalies in large-scale network environments. Extracting log message formats is an important and necessary step to achieve the goal. However, it is time-consuming and costly to automatically and efficiently extract log message formats from massive network and security logs of many different types, which are generated by the increasing number of network and security devices and services used in large-scale networks. In this paper, we propose log template extraction (LTE), an approach that is semantics aware of network and security logs to address the problem. LTE first cleans log messages and then clusters the cleaned log messages based on the DBSCAN algorithm. At last it infers message templates by LDA Gibbs sampling algorithm. We evaluate our work on massive amount of network log messages collected from a large production network. Experimental results show that LTE approach infers and gets multiple log message formats at the same time with more than 90% accuracy and 100% recall.
Keywords
"Security","Ports (Computers)","Protocols","Clustering algorithms","IP networks","Reverse engineering","Data mining"
Publisher
ieee
Conference_Titel
Military Communications Conference, MILCOM 2015 - 2015 IEEE
Type
conf
DOI
10.1109/MILCOM.2015.7357664
Filename
7357664
Link To Document