DocumentCode
1126750
Title
Optimal remapping in dynamic bulk synchronous computations via a stochastic control approach
Author
Yin, G. George ; Xu, Cheng-Zhong ; Le Yi, Wang
Author_Institution
Dept. of Math., Wayne State Univ., Detroit, MI, USA
Volume
14
Issue
1
fYear
2003
fDate
1/1/2003 12:00:00 AM
Firstpage
51
Lastpage
62
Abstract
A bulk synchronous computation proceeds in phases that are separated by barrier synchronization. For dynamic bulk synchronous computations that exhibit varying phase-wise computational requirements, remapping at runtime is an effective approach to ensure parallel efficiency. The paper introduces a novel remapping strategy for computations whose workload changes can be modeled as a Markov chain. The use of a Markovian model allows us to treat statistical dependence and more complex structure than the usual independent identically distributed random variable assumptions. Our models are quite general and we do not need to impose conditions on the dynamics of the underlying process other than the transition probability matrix. It is shown that optimal remapping can be formulated as a binary decision process: remap or not at a given synchronizing instant. The optimal strategy is then developed for long lasting computations by employing optimal stopping rules in a stochastic control framework. The existence of optimal controls is established. Necessary and sufficient conditions for the optimality are obtained. Furthermore, a policy iteration algorithm is devised to reduce computational complexity and enhance fast convergence to the desired optimal control.
Keywords
Markov processes; computational complexity; parallel processing; processor scheduling; stochastic processes; synchronisation; Markov chain; Markovian model; barrier synchronization; binary decision process; bulk synchronous computation; computational complexity; dynamic bulk synchronous computations; independent identically distributed random variable assumptions; optimal controls; optimal remapping; optimal stopping rules; optimal strategy; parallel efficiency; policy iteration algorithm; remapping strategy; statistical dependence; stochastic control approach; synchronizing instant; transition probability matrix; varying phase-wise computational requirements; workload changes; Computational fluid dynamics; Computational modeling; Concurrent computing; Dynamic scheduling; Optimal control; Peer to peer computing; Probability; Random variables; Runtime; Stochastic processes;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2003.1167370
Filename
1167370
Link To Document