مرکز منطقه ای اطلاع رساني علوم و فناوري - Higher Level Application of ADP: A Next Phase for the Control Field?

DocumentCode :

776326

Title :

Higher Level Application of ADP: A Next Phase for the Control Field?

Author :

Lendaris, George G.

Author_Institution :

Dept. of Electr. & Comput. Eng., Portland State Univ., Portland, OR

Volume :

Issue :

fYear :

2008

Firstpage :

901

Lastpage :

912

Abstract :

Two distinguishing features of humanlike control vis-a-vis current technological control are the ability to make use of experience while selecting a control policy for distinct situations and the ability to do so faster and faster as more experience is gained (in contrast to current technological implementations that slow down as more knowledge is stored). The notions of context and context discernment are important to understanding this human ability. Whereas methods known as adaptive control and learning control focus on modifying the design of a controller as changes in context occur, experience-based (EB) control entails selecting a previously designed controller that is appropriate to the current situation. Developing the EB approach entails a shift of the technologist´s focus ldquoup a levelrdquo away from designing individual (optimal) controllers to that of developing online algorithms that efficiently and effectively select designs from a repository of existing controller solutions. A key component of the notions presented here is that of higher level learning algorithm. This is a new application of reinforcement learning and, in particular, approximate dynamic programming, with its focus shifted to the posited higher level, and is employed, with very promising results. The author´s hope for this paper is to inspire and guide future work in this promising area.

Keywords :

adaptive control; dynamic programming; learning (artificial intelligence); learning systems; adaptive control; context discernment; dynamic programming; higher level application; humanlike control; individual controllers; learning control; reinforcement learning; Approximate dynamic programming (ADP); artificial intelligence (AI); context; context discernment; experience-based identification and control (EBIC); neural networks (NNs); optimal control; reinforcement learning (RL); system identification (SID); Artificial Intelligence; Biomimetics; Feedback; Humans; Learning; Programming, Linear; Systems Theory;

fLanguage :

English

Journal_Title :

Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

1083-4419

Type :

jour

DOI :

10.1109/TSMCB.2008.918073

Filename :

4554032

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=776326