Title :
Pseudo-MDPs and factored linear action models
Author :
Hengshuai Yao ; Szepesvari, Csaba ; Pires, Bernardo Avila ; Xinhua Zhang
Author_Institution :
Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB, Canada
Abstract :
In this paper we introduce the concept of pseudo-MDPs to develop abstractions. Pseudo-MDPs relax the requirement that the transition kernel has to be a probability kernel. We show that the new framework captures many existing abstractions. We also introduce the concept of factored linear action models; a special case. Again, the relation of factored linear action models and existing works are discussed. We use the general framework to develop a theory for bounding the suboptimality of policies derived from pseudo-MDPs. Specializing the framework, we recover existing results. We give a leastsquares approach and a constrained optimization approach of learning the factored linear model as well as efficient computation methods. We demonstrate that the constrained optimization approach gives better performance than the least-squares approach with normalization.
Keywords :
Markov processes; constraint handling; least squares approximations; probability; computation method; constrained optimization approach; factored linear action model; factored linear model; least-squares approach; leastsquares approach; normalization; probability kernel; pseudo-MDP; suboptimality; transition kernel; Approximation methods; Computational modeling; Equations; Feature extraction; Kernel; Mathematical model; Optimization;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010633