• DocumentCode
    1662391
  • Title

    On amount and quality of bias in reinforcement learning

  • Author

    Hailu, G. ; Sommer, G.

  • Author_Institution
    Dept. of Cognitive Syst., Kiel Univ., Germany
  • Volume
    2
  • fYear
    1999
  • fDate
    6/21/1905 12:00:00 AM
  • Firstpage
    728
  • Abstract
    Reinforcement learning is widely regarded as elegant in theory but hopelessly slow in practice. This is because it is often studied under the assumption that there is little or no prior information about the task at hand. This assumption, however, is not the defining characteristic of learning. Learning involves the incorporation of prior knowledge or bias that can greatly accelerate or otherwise improves the learning process. We address the influence of the amount and quality of bias on the speed of reinforcement learning. For a chosen class of learning problem different forms of biases are initially identified. Some of the biases are extracted from the knowledge of the environment, others from the task, and yet a few from both. Belief matrices, which reset Q-tables before learning commences, encode the biases. The average number of interactions between the agent and the environment is used to quantify the biases. Based on this performance measure, the biases are graded and some new results are reported. In addition, the paper compares continual learning to learning from scratch and presents results that clearly demonstrate the advantages of the former
  • Keywords
    learning (artificial intelligence); matrix algebra; Q-tables; belief matrices; bias; continual learning; learning from scratch; learning speed; reinforcement learning; Acceleration; Humans; Learning systems; Machine learning; Quantization; Robots; State-space methods; Table lookup;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-5731-0
  • Type

    conf

  • DOI
    10.1109/ICSMC.1999.825352
  • Filename
    825352