Reinforcement learning of dialogue strategies with hierarchical abstract machines

Author

Cuayahuitl, H. ; Renals, S. ; Lemon, O. ; Shimodaira, H.

Author_Institution

CSTR, Univ. of Edinburgh, Edinburgh

fYear

2006

fDate

10-13 Dec. 2006

Firstpage

182

Lastpage

185

Abstract

In this paper we propose partially specified dialogue strategies for dialogue strategy optimization, where part of the strategy is specified deterministically and the rest optimized with reinforcement learning (RL). To do this we apply RL with hierarchical abstract machines (HAMs). We also propose to build simulated users using HAMs, incorporating a combination of hierarchical deterministic and probabilistic behaviour. We performed experiments using a single-goal flight booking dialogue system, and compare two dialogue strategies (deterministic and optimized) using three types of simulated user (novice, experienced and expert). Our results show that HAMs are promising for both dialogue optimization and simulation, and provide evidence that indeed partially specified dialogue strategies can outperform deterministic ones (on average 4.7 fewer system turns) with faster learning than the traditional RL framework.

Keywords

interactive systems; learning (artificial intelligence); optimisation; probability; speech processing; HAM; dialogue strategy optimization; hierarchical abstract machines; hierarchical deterministic behaviour; hierarchical probabilistic behaviour; reinforcement learning; single-goal flight booking dialogue system; Aerospace simulation; Automata; Automatic control; Automatic speech recognition; Collaboration; Control systems; Databases; Informatics; Learning; Optimization methods;

fLanguage

English

Publisher

ieee

Conference_Titel

Spoken Language Technology Workshop, 2006. IEEE

Conference_Location

Palm Beach

Print_ISBN

1-4244-0872-5

Type

conf

DOI

10.1109/SLT.2006.326775

Filename

4123392