• DocumentCode
    635068
  • Title

    Learning and information for dual control

  • Author

    Alpcan, Tansu ; Shames, Iman ; Cantoni, Marco ; Nair, Girish

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Univ. of Melbourne, Melbourne, VIC, Australia
  • fYear
    2013
  • fDate
    23-26 June 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In dual control problems, the aim is to concurrently learn and control an unknown system. However, actively learning the system conflicts directly with any given control objective as it involves disturbing the system for exploration. This paper presents a multi-objective approach to dual control, which explicitly quantifies both the learning and control objectives. Mutual information and relative entropy from information theory are used to quantify the information gain in active learning as part of the exploration process. The information gain is then balanced against a standard control objective. The presented approach is illustrated using Gaussian process regression, which provides a framework for learning nonlinear systems and is used as a demonstrative example. It is shown that the derived information measures are closely related to the variance of the predictive Gaussian distribution estimating the system.
  • Keywords
    Gaussian distribution; entropy; learning systems; nonlinear control systems; Gaussian process regression; active learning; control objectives; dual control problem; exploration process; information gain quantification; information measure; information theory; learning nonlinear system; learning objectives; multiobjective approach; mutual information; predictive Gaussian distribution; relative entropy; unknown system control; Control systems; Entropy; Gaussian distribution; Ground penetrating radar; Measurement; Mutual information; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (ASCC), 2013 9th Asian
  • Conference_Location
    Istanbul
  • Print_ISBN
    978-1-4673-5767-8
  • Type

    conf

  • DOI
    10.1109/ASCC.2013.6606212
  • Filename
    6606212