Title of article
Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection
Author/Authors
Avila، نويسنده , , Luis and Martيnez، نويسنده , , Ernesto، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2014
Pages
19
From page
6327
To page
6345
Abstract
The increasing trend towards delegating tasks to autonomous artificial agents in safety–critical socio-technical systems makes monitoring an action selection policy of paramount importance. Agent behavior monitoring may profit from a stochastic specification of an optimal policy under uncertainty. A probabilistic monitoring approach is proposed to assess if an agent behavior (or policy) respects its specification. The desired policy is modeled by a prior distribution for state transitions in an optimally-controlled stochastic process. Bayesian surprise is defined as the Kullback–Leibler divergence between the state transition distribution for the observed behavior and the distribution for optimal action selection. To provide a sensitive on-line estimation of Bayesian surprise with small samples twin Gaussian processes are used. Timely detection of a deviant behavior or anomaly in an artificial pancreas highlights the sensitivity of Bayesian surprise to a meaningful discrepancy regarding the stochastic optimal policy when there exist excessive glycemic variability, sensor errors, controller ill-tuning and infusion pump malfunctioning. To reject outliers and leave out redundant information, on-line sparsification of data streams is proposed.
Keywords
Bayesian surprise , Behavior monitoring , Kullback–Leibler divergence , artificial pancreas , Optimal action selection
Journal title
Expert Systems with Applications
Serial Year
2014
Journal title
Expert Systems with Applications
Record number
2355083
Link To Document