DocumentCode :
2535442
Title :
Optimal Task Reallocation in Heterogeneous Distributed Computing Systems with Age-Dependent Delay Statistics
Author :
Pezoa, Jorge E. ; Hayat, Majeed M. ; Wang, Zhuoyao ; Dhakal, Sagar
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of New Mexico, Albuquerque, NM, USA
fYear :
2010
fDate :
13-16 Sept. 2010
Firstpage :
111
Lastpage :
120
Abstract :
This paper presents a general framework for optimal task reallocation in heterogeneous distributed-computing systems and offers a rigorous analytical model for the stochastic execution time of a workload. The model takes into account the heterogeneity and stochastic nature of the tasks´ service and transfer times, servers´ failure times, as well as an arbitrary task-reallocation policy. The stochastic service, transfer and failure times are assumed to have general, age-dependent (non-exponential) distributions, resulting in a tandem distributed queuing system with non-Markovian dynamics. Auxiliary age variables are introduced in the analysis to capture the memory associated with the non-Markovian stochastic times, thereby enabling a regenerative age-dependent analytical characterization of the statistics of the execution time of a workload. The model is utilized to devise task reallocation policies that optimize three metrics: the average execution time of a workload, the quality-of-service in executing a workload by a prescribed deadline and the reliability in executing a workload. Implications of the non-exponential event times on these metrics are also studied. Key results are verified experimentally on a distributed-computing testbed.
Keywords :
Markov processes; delays; distributed processing; quality of service; queueing theory; resource allocation; task analysis; age-dependent delay statistics; heterogeneous distributed computing systems; load balancing; nonMarkovian dynamics; optimal task reallocation; quality-of-service; stochastic service; tandem distributed queuing system; task-reallocation policy; Delay; Quality of service; Random variables; Reliability; Servers; Stochastic processes; communication delays; distributed computing; load balancing; non-markovian queues; regeneration time; task reallocation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2010 39th International Conference on
Conference_Location :
San Diego, CA
ISSN :
0190-3918
Print_ISBN :
978-1-4244-7913-9
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2010.20
Filename :
5599155
Link To Document :
بازگشت