• DocumentCode
    3326931
  • Title

    Active Bayesian feature weighting in reinforcement learning robot

  • Author

    Kaitwanidvilai, S. ; Parnichkun, M.

  • Author_Institution
    Sch. of Adv. Technol., Asian Inst. of Technol., Pathumthani, Thailand
  • Volume
    2
  • fYear
    2002
  • fDate
    11-14 Dec. 2002
  • Firstpage
    1090
  • Abstract
    A priori knowledge incorporation is known to be a bias for robot\´s exploration. This bias is intended to guide a robot to improve the quality of learning performance by selecting more significant sample in search space. However, main drawbacks of priori bias are that there is no guarantee that the final behavior is optimal and bias may be incorrect when the environment is changing. In this paper, we proposed additional guidance in a framework of active Bayesian network. Pre-defined features and expected utility function in our approach are used to determine the weighting factor of selecting action in Q-learning. We also use "evidence" from robot\´s experience which able to indicate that the current guidance knowledge (bias) is correct or not. This information is used to update parameters of Bayesian network by probabilistic adaptation algorithm. The posterior guidance knowledge can be taken into account based on this updating. Our approach is stated in general framework, which can be applied in any applications. A simple maze navigation problem is presented, using a Nomad200 mobile robot equipped with wireless video camera and frame grabber.
  • Keywords
    Bayes methods; learning (artificial intelligence); mobile robots; probability; Nomad200 mobile robot; Q-learning; a priori knowledge incorporation; active Bayesian feature weighting; active Bayesian network; frame grabber; learning performance quality improvement; maze navigation; mobile robot; parameter updating; posterior guidance knowledge; probabilistic adaptation algorithm; reinforcement learning robot; weighting factor; wireless video camera; Bayesian methods; Equations; Intelligent robots; Mobile robots; Orbital robotics; Robot sensing systems; Space technology; Supervised learning; Unsupervised learning; Utility theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Technology, 2002. IEEE ICIT '02. 2002 IEEE International Conference on
  • Print_ISBN
    0-7803-7657-9
  • Type

    conf

  • DOI
    10.1109/ICIT.2002.1189323
  • Filename
    1189323