DocumentCode :
3193445
Title :
LoadAtomizer: A locality and I/O load aware task scheduler for MapReduce
Author :
Asahara, Masato ; Nakadai, S. ; Araki, Takeshi
Author_Institution :
Cloud Syst. Res. Labs., NEC Corp., Kawasaki, Japan
fYear :
2012
fDate :
3-6 Dec. 2012
Firstpage :
317
Lastpage :
324
Abstract :
Data-intensive computing systems like MapReduce and Dryad have emerged as a framework for leveraging computing resources of a cluster. I/O bottlenecks need to be eased to improve performance in data-intensive computing systems. State-of-the-art frameworks for data-intensive computing have tackled the issue with a data locality based task scheduling policy. However, locality-aware scheduling does not always work good to mitigate I/O bottlenecks when different I/O characteristic jobs run concurrently. This paper presents LoadAtomizer, a locality and I/O load aware task scheduler for MapReduce. LoadAtomizer mitigates the I/O bottlenecks of a cluster with locality and I/O load aware map task assignment and storage selection. LoadAtomizer quickly assigns a slave a map task whose input data is stored in a lightly loaded storage and commands the slave to read the input data from the storage. LoadAtomizer maintains the load information of storages and the network with a topology-aware load tree. A topology-aware load tree enables LoadAtomizer to select quickly a lightly loaded storage that a slave can access through a lightly loaded network path. Experimental results demonstrated that our prototype of LoadAtomizer shortened completion time of multiple jobs by up to 18.6 %.
Keywords :
concurrency control; input-output programs; parallel programming; resource allocation; scheduling; software performance evaluation; storage management; task analysis; tree data structures; I/O bottleneck mitigation; I/O characteristic jobs; I/O load aware map task assignment; I/O load aware task scheduler; LoadAtomizer; MapReduce; cluster computing resources; data locality-based task scheduling policy; data-intensive computing systems; lightly loaded network path; lightly loaded storage; locality aware task scheduler; performance improvement; storage selection; topology-aware load tree; Bandwidth; Cloud computing; Conferences; Processor scheduling; Scheduling; Switches; Throughput; I/O Control; Maplceduce; Task Scheduling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2012 IEEE 4th International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4673-4511-8
Electronic_ISBN :
978-1-4673-4509-5
Type :
conf
DOI :
10.1109/CloudCom.2012.6427572
Filename :
6427572
Link To Document :
بازگشت