مرکز منطقه ای اطلاع رساني علوم و فناوري - Tradeoff between exploration and exploitation of OQ(λ) with non-Markovian update in dynamic environments

DocumentCode :

2961040

Title :

Tradeoff between exploration and exploitation of OQ(λ) with non-Markovian update in dynamic environments

Author :

Shokri, Maryam ; Tizhoosh, Hamid R. ; Kamel, Mohamed S.

Author_Institution :

Dept. of Syst. Design Eng., Univ. of Waterloo, Waterloo, ON

fYear :

2008

fDate :

1-8 June 2008

Firstpage :

2915

Lastpage :

2921

Abstract :

This paper presents some investigations on tradeoff between exploration and exploitation of opposition-based Q(lambda) with non-Markovian update (NOQ(lambda)) in a dynamic environment. In the previous work the authors applied NOQ(lambda) to the deterministic GridWorld problem. In this paper, we have implemented the NOQ(lambda) algorithm for a simple elevator control problem to test the behavior of the algorithm for non-deterministic and dynamic environment. We also extend the NOQ(lambda) algorithm by introducing the opposition weight to find a better tradeoff between exploration and exploitation for the NOQ(lambda) technique. The value of the opposition weight increases as the number of steps increases. Hence, it has more positive effects on the Q-value updates for opposite actions as the learning progresses. The performance of NOQ(lambda) method is compared with Q(lambda) technique. The experiments indicate that NOQ(lambda) performs better than Q(lambda).

Keywords :

learning (artificial intelligence); lifts; deterministic GridWorld problem; dynamic environments; nonMarkovian update; simple elevator control problem; Bridges; Design engineering; Dynamic programming; Elevators; Feedback; Intelligent systems; Learning; Monte Carlo methods; Systems engineering and theory; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on

Conference_Location :

Hong Kong

ISSN :

1098-7576

Print_ISBN :

978-1-4244-1820-6

Electronic_ISBN :

1098-7576

Type :

conf

DOI :

10.1109/IJCNN.2008.4634208

Filename :

4634208

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2961040