DocumentCode :
179041
Title :
Log-linear dialog manager
Author :
Hao Tang ; Watanabe, Shigetaka ; Marks, Tim K. ; Hershey, John R.
Author_Institution :
Toyota Technol. Inst. at Chicago, Chicago, IL, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
4092
Lastpage :
4096
Abstract :
We design a log-linear probabilistic model for solving the dialog management task. In both planning and learning we optimize the same objective function: the expected reward. Rather than performing full policy optimization, we perform on-line estimation of the optimal action as a belief-propagation inference step. We employ context-free grammars to describe our variable spaces, which enables us to define rich features. To scale our approach to large variable spaces, we use particle belief propagation. Experiments show that the model is able to choose system actions that yield a high expected reward, outperforming its POMDP-like log-linear counterpart and a hand-crafted rule-based system.
Keywords :
Markov processes; belief maintenance; inference mechanisms; knowledge based systems; speech processing; POMDP; belief propagation inference; dialog management task; full policy optimization; log-linear dialog manager; log-linear probabilistic model; online estimation; partially observable Markov decision process; particle belief propagation; rule based system; Belief propagation; Grammar; Optimization; Planning; Probabilistic logic; Probability distribution; Production; Dialog Manager; Log-linear Model; POMDP;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854371
Filename :
6854371
Link To Document :
بازگشت