On amount and quality of bias in reinforcement learning

Author

Hailu, G. ; Sommer, G.

Author_Institution

Dept. of Cognitive Syst., Kiel Univ., Germany

Volume

2

fYear

1999

fDate

6/21/1905 12:00:00 AM

Firstpage

728

Abstract

Reinforcement learning is widely regarded as elegant in theory but hopelessly slow in practice. This is because it is often studied under the assumption that there is little or no prior information about the task at hand. This assumption, however, is not the defining characteristic of learning. Learning involves the incorporation of prior knowledge or bias that can greatly accelerate or otherwise improves the learning process. We address the influence of the amount and quality of bias on the speed of reinforcement learning. For a chosen class of learning problem different forms of biases are initially identified. Some of the biases are extracted from the knowledge of the environment, others from the task, and yet a few from both. Belief matrices, which reset Q-tables before learning commences, encode the biases. The average number of interactions between the agent and the environment is used to quantify the biases. Based on this performance measure, the biases are graded and some new results are reported. In addition, the paper compares continual learning to learning from scratch and presents results that clearly demonstrate the advantages of the former

Keywords

learning (artificial intelligence); matrix algebra; Q-tables; belief matrices; bias; continual learning; learning from scratch; learning speed; reinforcement learning; Acceleration; Humans; Learning systems; Machine learning; Quantization; Robots; State-space methods; Table lookup;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on

Conference_Location

Tokyo

ISSN

1062-922X

Print_ISBN

0-7803-5731-0

Type

conf

DOI

10.1109/ICSMC.1999.825352

Filename

825352