DocumentCode
2944630
Title
Robot Learning in Partially Observable, Noisy, Continuous Worlds
Author
Broadbent, Reid ; Peterson, Todd
Author_Institution
Computer Science Department Brigham Young University Provo, Utah 84602 reid@byu.net
fYear
2005
fDate
18-22 April 2005
Firstpage
4386
Lastpage
4393
Abstract
Partially-observable Markov decision problems (POMDPs) pose special difficulties for the task of learning robot control policies, due to the need to disambiguate perceptually aliased states. Short-term memories of recent actions and/or percepts are required to provide context for the robot to perform such disambiguation. We introduce Variable-Resolution Percept Discretization (VRPD) as an extension to Utile Suffix Memory (USM), an algorithm designed to solve discrete POMDPs. This extension allows USM to function effectively in noisy, continuous worlds. We describe the extension in detail, then we demonstrate experimentally the improvements that it makes to USM in the context of continuous POMDPs.
Keywords
Algorithm design and analysis; Computer science; Educational institutions; Frequency; Humans; Observability; Orbital robotics; Robot control; Sonar; State-space methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on
Print_ISBN
0-7803-8914-X
Type
conf
DOI
10.1109/ROBOT.2005.1570795
Filename
1570795
Link To Document