DocumentCode :
3268900
Title :
Optimization in Markov decision problems with transition-dependent cost functions
Author :
Wang, Junjie ; Xi-Ren Cao
Author_Institution :
Dept. of Electr. Eng., Maryland Univ., College Park, MD, USA
Volume :
2
fYear :
1999
fDate :
1999
Firstpage :
1478
Abstract :
The traditional MDP deals with the cost function which only depends on the state and the corresponding action. In the real world however, there are many applications where the cost incurred depends on the particular transition as well, which makes the traditional MDP solution infeasible for these problems. We apply the performance potential theory as an optimization tool for MDP. In particular the notion of the expanded Markov chain is introduced to map this problem to a general form. Both computation-based and sample-path-based algorithms are developed for potential derivation. We address ourselves to the complexity-reduction techniques. Finally, we apply these techniques to the “join the shortest queue” application, which is a significant component in the analysis of communication systems
Keywords :
Markov processes; computational complexity; decision theory; matrix algebra; optimisation; queueing theory; Markov decision problems; communication systems; complexity-reduction techniques; computation-based algorithms; expanded Markov chain; join the shortest queue application; optimization tool; performance potential theory; sample-path-based algorithms; transition-dependent cost functions; Communication networks; Communication system control; Control systems; Cost function; Dynamic programming; Educational institutions; Manufacturing systems; Queueing analysis; State-space methods; Yield estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Applications, 1999. Proceedings of the 1999 IEEE International Conference on
Conference_Location :
Kohala Coast, HI
Print_ISBN :
0-7803-5446-X
Type :
conf
DOI :
10.1109/CCA.1999.801190
Filename :
801190
Link To Document :
بازگشت