Title :
Extraction of Error Detection Rules without Supervised Information from Log Files Using Automatically Defined Groups
Author :
KUROSAWA, Yoshiaki ; HARA, Akira ; Ichimura, Takumi ; KAWANO, Yuji
Author_Institution :
Hiroshima City Univ., Hiroshima
Abstract :
Our main aim is to extract multiple rules from log files in the computer systems, to detect various levels of errors, and to inform these errors or configuration mistakes to the system administrators automatically, in order to manage them without expert knowledge. To satisfy this aim, we performed an extraction experiment from the log files of a system using Automatically Defined Groups (ADG), which is based on Genetic Programming. Moreover, we focused on "System State Pattern" related to the difference between normal daily state and abnormal state that some errors occur in the system. In this experiment, then, we tried to extract rules without any manually managed and supervised information, by using simple translation technique: regular expressions. As a result, 50 agents in the best individual were divided into 16 groups from 322 log files. This means that 16 rules were acquired. We confirmed these rules could detect some errors such as DNS configuration error. We could also find the importance of the rules because the rule with more agents tended to have a higher adopted frequency by evolutionary computation. Therefore, we consider that our method using ADG is useful for the diagnosis of computer systems, and helps administrators manage their systems without expert knowledge about their systems.
Keywords :
data mining; genetic algorithms; learning (artificial intelligence); multi-agent systems; pattern clustering; ADG machine learning method; DNS configuration error; agent grouping; automatically defined groups; data clustering; data mining; error detection rule extraction; evolutionary computation; genetic programming; log file supervised information; regular expressions; system state pattern; translation technique; Computer errors; Computer networks; Cybernetics; Data mining; Databases; Genetic programming; Knowledge management; Local area networks; Operating systems; Protocols;
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
DOI :
10.1109/ICSMC.2006.385153