مرکز منطقه ای اطلاع رساني علوم و فناوري - Credit of optimal state transition based reinforcement learning algorithm

DocumentCode :

406112

Title :

Credit of optimal state transition based reinforcement learning algorithm

Author :

Bai, Tingfeng ; Wu, Gengfeng

Author_Institution :

Sch. of Comput. Sci. & Eng., Shanghai Univ., China

Volume :

fYear :

2003

fDate :

14-17 Dec. 2003

Firstpage :

Abstract :

This paper proposed an optimal model based on the distance between current state and goal state and the cost of state transition in order to solve goal state problem more effectively. Based on the optimal model, a unique reinforcement learning algorithm named COSTRLA (credit of optimal state transition based reinforcement learning algorithm) is also presented. The COSTRLA defined a COST function used to evaluate optimality of output strategy, developed update principle for the COST function based on the dynamic programming principle, while reinforcement signal is defined as the distance from current state to goal state. The COSTRLA was applied into cooperative control of Buddy-Arnolds robot. The simulation experiment has shown the advantages of COSTRLA over some popular reinforcement learning algorithms such as Q-learning and prioritized sweeping algorithms.

Keywords :

cooperative systems; dynamic programming; learning (artificial intelligence); multi-robot systems; optimal systems; dynamic programming; optimal model; optimal state transition; prioritized sweeping algorithms; reinforcement learning algorithm; Computer science; Cost function; Dynamic programming; Learning; Predictive models; Robot control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on

Conference_Location :

Nanjing

Print_ISBN :

0-7803-7702-8

Type :

conf

DOI :

10.1109/ICNNSP.2003.1279213

Filename :

1279213

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=406112