Title of article :
Markov decision processes with delays and asynchronous cost collection
Author/Authors :
K.V.، Katsikopoulos, نويسنده , , S.E.، Engelbrecht, نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2003
Pages :
7
From page :
568
To page :
574
Abstract :
Markov decision processes (MDPs) may involve three types of delays. First, state information, rather than being available instantaneously, may arrive with a delay (observation delay). Second, an action may take effect at a later decision stage rather than immediately (action delay). Third, the cost induced by an action may be collected after a number of stages (cost delay). We de rive two results, one for constant and one for random delays, for reducing an MDP with delays to an MDP without delays, which differs only in the size of the state space. The results are based on the intuition that costs may be collected asynchronously, i.e., at a stage other than the one in which they are induced, as long as they are discounted properly.
Keywords :
natural convection , heat transfer , Analytical and numerical techniques
Journal title :
IEEE Transactions on Automatic Control
Serial Year :
2003
Journal title :
IEEE Transactions on Automatic Control
Record number :
97470
Link To Document :
بازگشت