• DocumentCode
    575536
  • Title

    Robust reinforcement learning technique with bigeminal representation of continuous state space for multi-robot systems

  • Author

    Yasuda, Toshiyuki ; Kage, Koki ; Ohkura, Kazuhiro

  • Author_Institution
    Fac. of Eng., Hiroshima Univ., Hiroshima, Japan
  • fYear
    2012
  • fDate
    20-23 Aug. 2012
  • Firstpage
    1552
  • Lastpage
    1557
  • Abstract
    We have been developing a reinforcement learning technique called Bayesian-discrimination-function-based reinforcement learning (BRL) as an approach to autonomous specialization, which is a new concept in cooperative multirobot systems. BRL has a mechanism for autonomously segmenting the continuous state and action space. However, as in other machine learning approaches, overfitting is occasionally observed after successful learning. This paper proposes a technique to sophisticatedly utilize messy knowledge acquired using BRL. The proposed technique that has a doubly represented state space by parametric and nonparametric models is expected to show better learning performance and robustness against environmental changes. We investigate the proposed technique by conducting computer simulations of a cooperative transport task.
  • Keywords
    belief networks; learning (artificial intelligence); multi-robot systems; BRL; Bayesian-discrimination-function-based reinforcement learning; bigeminal representation; continuous state space; cooperative multirobot systems; cooperative transport task; machine learning approaches; multi-robot systems; robust reinforcement learning technique; Learning; Mobile robots; Robot kinematics; Robot sensing systems; Robustness; Support vector machines; continuous state space; multi-robot system; nonparametric model; parametric model; reinforcement learning; robustness; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE Annual Conference (SICE), 2012 Proceedings of
  • Conference_Location
    Akita
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-2259-1
  • Type

    conf

  • Filename
    6318698