مرکز منطقه ای اطلاع رساني علوم و فناوري - Application of reinforcement learning control to a nonlinear bouncing cart

DocumentCode :

343054

Title :

Application of reinforcement learning control to a nonlinear bouncing cart

Author :

Bucak, Ihsan Omur ; Zohdy, Mohamed A.

Author_Institution :

Sch. of Eng. & Comput. Sci., Oakland Univ., Rochester, MI, USA

Volume :

fYear :

1999

fDate :

2-4 June 1999

Firstpage :

1198

Abstract :

We consider a nonlinear bouncing cart motion, controlled by reinforcement learning (RL) control. The learning algorithm consists of Q-learning and advantage updating (AU) to keep the cart within desired limits. Q-learning is a RL algorithm that applies "delayed reinforcement" and performs optimal actions to maximize return values whereby the system performance is evaluated. RL is also extended through the use of AU in continuous-time. AU is another RL algorithm that stores both value function and advantage function, representing an estimate of the degree to which the expected total discounted reinforcement is increased by performing action other than the action currently considered to be best.

Keywords :

dynamic programming; learning (artificial intelligence); learning systems; motion control; nonlinear control systems; Q-learning; advantage function; advantage updating; delayed reinforcement; expected total discounted reinforcement; nonlinear bouncing cart; optimal actions; reinforcement learning control; value function; Application software; Computer science; Control systems; Electronic mail; Gold; Learning; Motion control; Nonlinear control systems; System performance; Systems engineering and theory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

American Control Conference, 1999. Proceedings of the 1999

Conference_Location :

San Diego, CA, USA

ISSN :

0743-1619

Print_ISBN :

0-7803-4990-3

Type :

conf

DOI :

10.1109/ACC.1999.783230

Filename :

783230

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=343054