• DocumentCode
    1858464
  • Title

    Reinforcement learning of dialogue strategies with hierarchical abstract machines

  • Author

    Cuayahuitl, H. ; Renals, S. ; Lemon, O. ; Shimodaira, H.

  • Author_Institution
    CSTR, Univ. of Edinburgh, Edinburgh
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    182
  • Lastpage
    185
  • Abstract
    In this paper we propose partially specified dialogue strategies for dialogue strategy optimization, where part of the strategy is specified deterministically and the rest optimized with reinforcement learning (RL). To do this we apply RL with hierarchical abstract machines (HAMs). We also propose to build simulated users using HAMs, incorporating a combination of hierarchical deterministic and probabilistic behaviour. We performed experiments using a single-goal flight booking dialogue system, and compare two dialogue strategies (deterministic and optimized) using three types of simulated user (novice, experienced and expert). Our results show that HAMs are promising for both dialogue optimization and simulation, and provide evidence that indeed partially specified dialogue strategies can outperform deterministic ones (on average 4.7 fewer system turns) with faster learning than the traditional RL framework.
  • Keywords
    interactive systems; learning (artificial intelligence); optimisation; probability; speech processing; HAM; dialogue strategy optimization; hierarchical abstract machines; hierarchical deterministic behaviour; hierarchical probabilistic behaviour; reinforcement learning; single-goal flight booking dialogue system; Aerospace simulation; Automata; Automatic control; Automatic speech recognition; Collaboration; Control systems; Databases; Informatics; Learning; Optimization methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326775
  • Filename
    4123392