DocumentCode :
1916526
Title :
Resource Management for Dynamic MapReduce Clusters in Multicluster Systems
Author :
Ghit, Bogdan ; Yigitbasi, Nezih ; Epema, Dick
Author_Institution :
Delft Univ. of Technol., Delft, Netherlands
fYear :
2012
fDate :
10-16 Nov. 2012
Firstpage :
1252
Lastpage :
1259
Abstract :
State-of-the-art MapReduce frameworks such as Hadoop can easily scale up to thousands of machines and to large numbers of users. Nevertheless, some users may require isolated environments to develop their applications and to process their data, which calls for multiple deployments of MR clusters within the same physical infrastructure. In this paper, we design and implement a resource management system to facilitate the on-demand isolated deployment of MapReduce clusters in multicluster systems. Deploying multiple MapReduce clusters enables four types of isolation, with respect to performance, to data management, to fault tolerance, and to versioning. To efficiently manage the underlying physical resources, we propose three provisioning policies for dynamically resizing MapReduce clusters, and we evaluate the performance of our system through experiments on a real multicluster.
Keywords :
multiprocessing systems; parallel programming; resource allocation; Hadoop framework; data management; dynamic MapReduce cluster; fault tolerance; multicluster system; provisioning policy; resource management; versioning; MapReduce isolation; dynamic resource management; multicluster systems; performance evaluation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-6218-4
Type :
conf
DOI :
10.1109/SC.Companion.2012.151
Filename :
6495933
Link To Document :
بازگشت