DocumentCode
716652
Title
HCMDP: A hierarchical solution to Constrained Markov Decision Processes
Author
Feyzabadi, Seyedshams ; Carpin, Stefano
Author_Institution
Sch. of Eng., Univ. of California, Merced, Merced, CA, USA
fYear
2015
fDate
26-30 May 2015
Firstpage
3971
Lastpage
3978
Abstract
Constrained Markov Decision Processes offer a principled way to tackle sequential decision problems with multiple objectives. Although they could be very valuable in numerous robotic applications, to date their use has been quite limited. One of the reasons is that their solution requires to solve constrained linear programs with a large number of variables and this is computationally demanding, especially when considering dynamic environments. In this paper we propose a hierarchical approach to solve large CMDPs. States are clustered into macro states and relevant parameters like transition probabilities and costs are extracted with a Monte Carlo approach. Macro states are created with the objective of grouping together states with similar costs while preserving feasibility. We illustrate the value of our findings in a path planning scenario where the robot moves through an environment characterized by different risk levels. Our approach largely outperforms the non-hierarchical method and we also show how it prevails over methods based on fixed partitioning strategies.
Keywords
Markov processes; Monte Carlo methods; decision theory; linear programming; path planning; robots; HCMDP; Monte Carlo approach; constrained linear programming; dynamic environments; fixed partitioning strategy; hierarchical solution to constrained Markov decision processes; macro states; nonhierarchical method; path planning; sequential decision problems; transition probability; Clustering algorithms; Computational modeling; Linear programming; Markov processes; Merging; Monte Carlo methods; Robots;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location
Seattle, WA
Type
conf
DOI
10.1109/ICRA.2015.7139754
Filename
7139754
Link To Document