DocumentCode :
423961
Title :
A solving method for MDPs by minimizing variational free energy
Author :
Yoshimoto, Junichiro ; Ishii, Shin
Author_Institution :
CREST, Japan Sci. & Technol. Agency, Japan
Volume :
3
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
1817
Abstract :
We propose a novel approach to acquire the optimal policy for a continuous Markov decision process. Based on an analogy from statistical mechanics, we introduce a variational free energy over a policy. A good policy can be obtained by minimizing the variational free energy. According to our approach, the optimal policy in linear quadratic regulator problems can be obtained by using Kalman filtering and smoothing techniques. Even in non-linear problems, a semi-optimal policy can be obtained by Monte Carlo technique with a Gaussian process method.
Keywords :
Gaussian processes; Kalman filters; Markov processes; Monte Carlo methods; decision theory; linear quadratic control; minimisation; smoothing methods; statistical mechanics; Gaussian process method; Kalman filtering; Monte Carlo technique; continuous Markov decision process; linear quadratic regulator; nonlinear problems; semioptimal policy; smoothing techniques; statistical mechanics; variational free energy minimization; Cost function; Decision making; Filtering; Gradient methods; Kalman filters; Nonlinear filters; Probability distribution; Regulators; Smoothing methods; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1380884
Filename :
1380884
Link To Document :
بازگشت