• DocumentCode
    2454190
  • Title

    Ensembles of Neural Networks for Robust Reinforcement Learning

  • Author

    Hans, Alexander ; Udluft, Steffen

  • Author_Institution
    Neuroinformatics & Cognitive Robot. Lab., Ilmenau Univ. of Technol., Ilmenau, Germany
  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    401
  • Lastpage
    406
  • Abstract
    Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their training and the validation of final policies can be cumbersome as neural networks can suffer from problems like local minima or over fitting. When using iterative methods, such as neural fitted Q-iteration, the problem becomes even more pronounced since the network has to be trained multiple times and the training process in one iteration builds on the network trained in the previous iteration. Therefore errors can accumulate. In this paper we propose to use ensembles of networks to make the learning process more robust and produce near-optimal policies more reliably. We name various ways of combining single networks to an ensemble that results in a final ensemble policy and show the potential of the approach using a benchmark application. Our experiments indicate that majority voting is superior to Q-averaging and using heterogeneous ensembles (different network topologies) is advisable.
  • Keywords
    function approximation; iterative methods; learning (artificial intelligence); neural nets; Q averaging; benchmark application; function approximator; iterative method; near optimal policy; neural network; reinforcement learning; training process; Approximation algorithms; Artificial neural networks; Function approximation; Network topology; Neurons; Training; ensemble methods; neural fitted Q-iteration; neural networks; reinforcement learning; robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.66
  • Filename
    5708863