Evolutionary computation versus reinforcement learning

Author

Schmidhuber, Jürgen

Author_Institution

IDSIA, Manno, Switzerland

Volume

4

fYear

2000

fDate

2000

Firstpage

2992

Abstract

Many applications of reinforcement learning (RL) and evolutionary computation (EC) are addressing the same problem, namely, to maximize some agent´s fitness function in a potentially unknown environment. The most challenging open issues in such applications include partial observability of the agent´s environment, hierarchical and other types of abstract credit assignment, and the learning of credit assignment algorithms. I summarize why EC provides a more natural framework for addressing these issues than RL based on value functions and dynamic programming. Then I point out fundamental drawbacks of traditional EC methods in case of stochastic environments, stochastic policies, and unknown temporal delays between actions and observable effects. I discuss a remedy called the success-story algorithm which combines aspects of RL and EC

Keywords

dynamic programming; evolutionary computation; learning (artificial intelligence); abstract credit assignment; agent fitness function; dynamic programming; evolutionary computation; partial observability; reinforcement learning; stochastic environments; stochastic policies; success-story algorithm; unknown environment; unknown temporal delays; value functions; Ambient intelligence; Evolutionary computation; Layout; Learning; Petroleum; Tiles;

fLanguage

English

Publisher

ieee

Conference_Titel

Industrial Electronics Society, 2000. IECON 2000. 26th Annual Confjerence of the IEEE

Conference_Location

Nagoya

Print_ISBN

0-7803-6456-2

Type

conf

DOI

10.1109/IECON.2000.972474

Filename

972474