مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-domain dialogue success classifiers for policy training

DocumentCode :

3744915

Title :

Multi-domain dialogue success classifiers for policy training

Author :

David Vandyke;Pei-Hao Su;Milica Gasic;Nikola Mrksic;Tsung-Hsien Wen;Steve Young

Author_Institution :

Department of Engineering, University of Cambridge, Cambridge, UK

fYear :

2015

Firstpage :

763

Lastpage :

770

Abstract :

We propose a method for constructing dialogue success classifiers that are capable of making accurate predictions in domains unseen during training. Pooling and adaptation are also investigated for constructing multi-domain models when data is available in the new domain. This is achieved by reformulating the features input to the recurrent neural network models introduced in [1]. Importantly, on our task of main interest, this enables policy training in a new domain without the dialogue success classifier (which forms the reinforcement learning reward function) ever having seen data from that domain before. This occurs whilst incurring only a small reduction in performance relative to developing and using an in-domain dialogue success classifier. Finally, given the motivation with these dialogue success classifiers is to enable policy training with real users, we demonstrate that these initial policy training results obtained with a simulated user carry over to learning from paid human users.

Keywords :

"Training","Adaptation models","Predictive models","Data models","Semantics","Standards","Entropy"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404865

Filename :

7404865

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744915