DocumentCode :
1791543
Title :
Optimizing load balancing and data-locality with data-aware scheduling
Author :
Ke Wang ; Xraobing Zhou ; Tonglin Li ; Dongfang Zhao ; Lang, Michael ; Raicu, Ioan
Author_Institution :
Illinois Inst. of Technol., Chicago, IL, USA
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
119
Lastpage :
128
Abstract :
Load balancing techniques (e.g. work stealing) are important to obtain the best performance for distributed task scheduling systems that have multiple schedulers making scheduling decisions. In work stealing, tasks are randomly migrated from heavy-loaded schedulers to idle ones. However, for data-intensive applications where tasks are dependent and task execution involves processing a large amount of data, migrating tasks blindly yields poor data-locality and incurs significant data-transferring overhead. This work improves work stealing by using both dedicated and shared queues. Tasks are organized in queues based on task data size and location. We implement our technique in MATRIX, a distributed task scheduler for many-task computing. We leverage distributed key-value store to organize and scale the task metadata, task dependency, and data-locality. We evaluate the improved work stealing technique with both applications and micro-benchmarks structured as direct acyclic graphs. Results show that the proposed data-aware work stealing technique performs well.
Keywords :
directed graphs; parallel processing; resource allocation; scheduling; MATRIX; data-aware scheduling; data-aware work stealing technique; data-locality; dedicated queues; direct acyclic graphs; distributed key-value store; distributed task scheduler; load balancing; many-task computing; shared queues; task dependency; task metadata; Computer architecture; Distributed databases; Load management; Processor scheduling; Scheduling; Servers; Throughput; data-aware scheduling; data-intensive computing; key-value stores; many-task computing; work stealing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004220
Filename :
7004220
Link To Document :
بازگشت