Robot Learning in Partially Observable, Noisy, Continuous Worlds

Author

Broadbent, Reid ; Peterson, Todd

Author_Institution

Computer Science Department Brigham Young University Provo, Utah 84602 reid@byu.net

fYear

2005

fDate

18-22 April 2005

Firstpage

4386

Lastpage

4393

Abstract

Partially-observable Markov decision problems (POMDPs) pose special difficulties for the task of learning robot control policies, due to the need to disambiguate perceptually aliased states. Short-term memories of recent actions and/or percepts are required to provide context for the robot to perform such disambiguation. We introduce Variable-Resolution Percept Discretization (VRPD) as an extension to Utile Suffix Memory (USM), an algorithm designed to solve discrete POMDPs. This extension allows USM to function effectively in noisy, continuous worlds. We describe the extension in detail, then we demonstrate experimentally the improvements that it makes to USM in the context of continuous POMDPs.

Keywords

Algorithm design and analysis; Computer science; Educational institutions; Frequency; Humans; Observability; Orbital robotics; Robot control; Sonar; State-space methods;

fLanguage

English

Publisher

ieee

Conference_Titel

Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on

Print_ISBN

0-7803-8914-X

Type

conf

DOI

10.1109/ROBOT.2005.1570795

Filename

1570795

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2944630