• Title of article

    Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection

  • Author/Authors

    Avila، نويسنده , , Luis and Martيnez، نويسنده , , Ernesto، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2014
  • Pages
    19
  • From page
    6327
  • To page
    6345
  • Abstract
    The increasing trend towards delegating tasks to autonomous artificial agents in safety–critical socio-technical systems makes monitoring an action selection policy of paramount importance. Agent behavior monitoring may profit from a stochastic specification of an optimal policy under uncertainty. A probabilistic monitoring approach is proposed to assess if an agent behavior (or policy) respects its specification. The desired policy is modeled by a prior distribution for state transitions in an optimally-controlled stochastic process. Bayesian surprise is defined as the Kullback–Leibler divergence between the state transition distribution for the observed behavior and the distribution for optimal action selection. To provide a sensitive on-line estimation of Bayesian surprise with small samples twin Gaussian processes are used. Timely detection of a deviant behavior or anomaly in an artificial pancreas highlights the sensitivity of Bayesian surprise to a meaningful discrepancy regarding the stochastic optimal policy when there exist excessive glycemic variability, sensor errors, controller ill-tuning and infusion pump malfunctioning. To reject outliers and leave out redundant information, on-line sparsification of data streams is proposed.
  • Keywords
    Bayesian surprise , Behavior monitoring , Kullback–Leibler divergence , artificial pancreas , Optimal action selection
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2014
  • Journal title
    Expert Systems with Applications
  • Record number

    2355083