• DocumentCode
    1576179
  • Title

    Intrinsically motivated neuroevolution for vision-based reinforcement learning

  • Author

    Cuccu, Giuseppe ; Luciw, Matthew ; Schmidhuber, Jürgen ; Gomez, Faustino

  • Author_Institution
    IDSIA, Univ. of Lugano, Manno-Lugano, Switzerland
  • Volume
    2
  • fYear
    2011
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Neuroevolution, the artificial evolution of neural networks, has shown great promise on continuous reinforcement learning tasks that require memory. However, it is not yet directly applicable to realistic embedded agents using high-dimensional (e.g. raw video images) inputs, requiring very large networks. In this paper, neuroevolution is combined with an unsupervised sensory pre-processor or compressor that is trained on images generated from the environment by the population of evolving recurrent neural network controllers. The compressor not only reduces the input cardinality of the controllers, but also biases the search toward novel controllers by rewarding those controllers that discover images that it reconstructs poorly. The method is successfully demonstrated on a vision-based version of the well-known mountain car benchmark, where controllers receive only single high-dimensional visual images of the environment, from a third-person perspective, instead of the standard two-dimensional state vector which includes information about velocity.
  • Keywords
    computer vision; learning (artificial intelligence); recurrent neural nets; artificial evolution; continuous reinforcement learning; embedded agents; high-dimensional visual images; intrinsically motivated neuroevolution; neural networks; recurrent neural network controllers; unsupervised sensory preprocessor; vision-based reinforcement learning; Tin; Yttrium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Development and Learning (ICDL), 2011 IEEE International Conference on
  • Conference_Location
    Frankfurt am Main
  • ISSN
    2161-9476
  • Print_ISBN
    978-1-61284-989-8
  • Type

    conf

  • DOI
    10.1109/DEVLRN.2011.6037324
  • Filename
    6037324