• DocumentCode
    2418788
  • Title

    Robotic gaze control using reinforcement learning

  • Author

    Rothbucher, Martin ; Denk, Christian ; Diepold, Klaus

  • Author_Institution
    Inst. for Data Process., Tech. Univ. Munchen, Munich, Germany
  • fYear
    2012
  • fDate
    8-9 Oct. 2012
  • Firstpage
    83
  • Lastpage
    88
  • Abstract
    This work examines how adaptive control can learn to point a camera at the active speaker in a conversation by using a Reinforcement Learning approach with audio and video data. A motivating scenario for this problem is a robotic platform that interacts with people around its environment. Using Reinforcement Learning, the task is specified with an observable objective referred to as the reward signal. Specifying this task with a reward signal enables an adaptive controller to improve its performance with experience. The reward for this task is generated by a visual feedback from the conversation participants that is detected by the robot´s camera system. Multiple experiments have been performed on a robot system with audiovisual data to examine the feasibility and potential of this approach. Our experimental results demonstrate that the system learns very fast to identify the active speakers. Furthermore, our approach inherently learns how to deal with egonoise that originates from the robot´s motor or background noise from the environment.
  • Keywords
    adaptive control; audio signal processing; cameras; feedback; human-robot interaction; intelligent robots; learning (artificial intelligence); robot vision; speaker recognition; active speaker identification; adaptive controller; audiovisual data; camera pointing; egonoise; reinforcement learning; reward signal; robotic gaze control; visual feedback; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Haptic Audio Visual Environments and Games (HAVE), 2012 IEEE International Workshop on
  • Conference_Location
    Munich
  • Print_ISBN
    978-1-4673-1568-5
  • Type

    conf

  • DOI
    10.1109/HAVE.2012.6374444
  • Filename
    6374444