DocumentCode
3191587
Title
Dual-JT: Toward the high availability of JobTracker in Hadoop
Author
Jian Wan ; Minggang Liu ; Xixiang Hu ; Zujie Ren ; Jilin Zhang ; Weisong Shi ; Wei Wu
Author_Institution
Sch. of Comput. Sci. & Technol., Hangzhou Dianzi Univ., Hangzhou, China
fYear
2012
fDate
3-6 Dec. 2012
Firstpage
263
Lastpage
268
Abstract
MapReduce is a state-of-the-art computation paradigm that is becoming widely used for processing large-scale datasets. Hadoop is an open-source implementation of MapReduce and follows a masterCslave architecture. This architecture makes Hadoop suffer from a single point of failure in the JobTracker. In this paper, we design a solution to resolve the single point of failure of the Job Tracker and then enhance its availability. In this solution, a standby Job Tracker is introduced to act as a hot backup node of the active Job Tracker. The standby Job Tracker synchronizes the job execution process with the active Job Tracker by collecting and parsing the job log. If the active Job Tracker fails, the standby Job Tracker can take over quickly. This solution is implemented in Hadoop 0.20.x. Extensive experiments illustrate that this solution effectively enhances the availability of Job Tracker. A big production cluster in a large e-Commerce company has adopted this solution, which avoids interrupting job submission and execution when the Job Tracker fails or restarts.
Keywords
distributed processing; electronic commerce; public domain software; Dual-JT; Hadoop 0.20.x; JobTracker; MapReduce; computation paradigm; e-commerce company; hot backup node; job execution process; job log collection; job log parsing; large-scale dataset processing; masterCslave architecture; open-source implementation; single point of failure; Availability; Computer architecture; Conferences; Delay; IP networks; Real-time systems; Synchronization; Hadoop; High Available; MapReduce; Single Point of Failure;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing Technology and Science (CloudCom), 2012 IEEE 4th International Conference on
Conference_Location
Taipei
Print_ISBN
978-1-4673-4511-8
Electronic_ISBN
978-1-4673-4509-5
Type
conf
DOI
10.1109/CloudCom.2012.6427485
Filename
6427485
Link To Document