DocumentCode :
2334115
Title :
Fast reinforcement learning using stochastic shortest paths for a mobile robot
Author :
Kwon, Wooyoung ; Suh, Il Hong ; Lee, Sanghoon ; Cho, Young-Jo
Author_Institution :
Electron. & Telecommun. Res. Inst., Daejeon
fYear :
2007
fDate :
Oct. 29 2007-Nov. 2 2007
Firstpage :
82
Lastpage :
87
Abstract :
Reinforcement learning (RL) has been used as a learning mechanism for a mobile robot to learn state-action relations without a priori knowledge of working environment. However, most RL methods usually suffer from slow convergence to learn optimum state-action sequence. In this paper, it is intended to improve a learning speed by compounding an existing Q-learning method with a shortest path finding algorithm. To integrate the shortest path algorithm with Q- learning method, a stochastic state-transition model is used to store a previous observed state, a previous action and a current state. Whenever a robot reaches a goal, a Stochastic Shortest Path(SSP) will be found from the stochastic state-transition model. State-action pairs on the SSP will be counted as more significant in the action selection. Using this learning method, the learning speed will be boosted when compared with classical RL methods. To show the validity of our proposed learning technology, several simulations and experimental results will be illustrated.
Keywords :
graph theory; learning (artificial intelligence); mobile robots; path planning; stochastic processes; Q-learning; mobile robot; optimum state-action sequence; reinforcement learning; state-transition model; stochastic shortest path finding; Convergence; Cost function; Intelligent robots; Learning systems; Machine learning; Mobile robots; Notice of Violation; Stochastic processes; Stochastic systems; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-0912-9
Electronic_ISBN :
978-1-4244-0912-9
Type :
conf
DOI :
10.1109/IROS.2007.4399040
Filename :
4399040
Link To Document :
بازگشت