Title :
CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase
Author :
Zhang, Chen ; De Sterck, Hans
Author_Institution :
David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
fDate :
Nov. 30 2010-Dec. 3 2010
Abstract :
As MapReduce becomes more and more popular in data processing applications, the demand for Hadoop clusters grows. However, Hadoop is incompatible with existing cluster batch job queuing systems and requires a dedicated cluster under its full control. Hadoop also lacks support for user access control, accounting, fine-grain performance monitoring and legacy batch job processing facilities comparable to existing cluster job queuing systems, making dedicated Hadoop clusters less amenable for administrators and normal users alike with hybrid computing needs involving both MapReduce and legacy applications. As a result, getting a properly suited and sized Hadoop cluster has not been easy in organizations with existing clusters. This paper presents Cloud BATCH, a prototype solution to this problem enabling Hadoop to function as a traditional batch job queuing system with enhanced functionality for cluster resource management. With Cloud BATCH, a complete shift to Hadoop for managing an entire cluster to cater for hybrid computing needs becomes feasible.
Keywords :
batch processing (computers); cloud computing; pattern clustering; queueing theory; resource allocation; software prototyping; CloudBATCH; HBase; Hadoop cluster; MapReduce; batch job queuing system; cluster resource management; data processing; hybrid computing; prototype solution; Access control; Bioinformatics; Cloud computing; Computer architecture; Data processing; Monitoring; Resource management; Cloud; Hadoop; batch job quequing;
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-9405-7
Electronic_ISBN :
978-0-7695-4302-4
DOI :
10.1109/CloudCom.2010.22