DocumentCode
685820
Title
An approach for log analysis based failure monitoring in Hadoop cluster
Author
Mohandas, Madhury ; Dhanya, P.M.
Author_Institution
Dept. of Comput. Sci. & Eng., Rajagiri Sch. of Eng. & Technol., Kochi, India
fYear
2013
fDate
12-14 Dec. 2013
Firstpage
861
Lastpage
867
Abstract
Massive and gargantuan amount of data is produced on per day basis. Such scenario elevates the need for apposite storage, supervision and processing of data. The massive use of Distributed framework calls for faster analysis and diagnosis of failures. Due to the distributed nature of processing, it is difficult for cluster administrator to isolate the failures and failed nodes. Many contributions have been done for failure monitoring, analysis etc in the last few years. Apache Hadoop´s Jobtracker, Namenode, Secondary Namenode, Datanode and Tasktracker all generate logs. This paper aims at building a failure monitoring system from the scratch, by parsing and analyzing the Hadoop log files generated in the cluster. The monitoring system gives all relevant details related to the application, and points out the specific reason for failure, that is, whether an application failure or a network failure (these are the most common failures in the cluster).
Keywords
Java; fault diagnosis; program diagnostics; public domain software; Apache Hadoop Jobtracker; Datanode; Hadoop cluster; Namenode; Secondary Namenode; Tasktracker; distributed framework; failure analysis; failure diagnosis; failure monitoring system; log analysis based failure monitoring; open source Java software framework; Computational modeling; Computer architecture; File systems; Google; History; Monitoring; BigData; Failure Monitoring; HDFS; Hadoop; Log Analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Green Computing, Communication and Conservation of Energy (ICGCE), 2013 International Conference on
Conference_Location
Chennai
Type
conf
DOI
10.1109/ICGCE.2013.6823555
Filename
6823555
Link To Document