DocumentCode
3485986
Title
Decision of response timing for incremental speech recognition with reinforcement learning
Author
Lu, Di ; Nishimoto, Takuya ; Minematsu, Nobuaki
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2011
fDate
11-15 Dec. 2011
Firstpage
467
Lastpage
472
Abstract
In spoken dialog systems, it is important to reduce the delay in generating a response to a user´s utterance. We investigate the use of incremental recognition results which can be obtained from a speech recognition engine before the input utterance ends. To enable the system to respond correctly before the end of the utterance, it is desired to utilize the incremental results effectively, although they are not reliable enough. We formulate this problem as a decision making task, in which the system makes choices iteratively either to answer based on previous observations, or to wait until the next observation. The reinforcement learning can be applied to the problem. As the results of experiments, the users highly evaluate the proposed method which estimate completion time of a user´s utterance by using the results of speech recognition based on mora units.
Keywords
decision making; learning (artificial intelligence); speech recognition; completion time estimation; decision making task; incremental speech recognition; mora units; reinforcement learning; response timing; speech recognition engine; spoken dialog systems; user utterance; Delay; Error analysis; Learning; Speech; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location
Waikoloa, HI
Print_ISBN
978-1-4673-0365-1
Electronic_ISBN
978-1-4673-0366-8
Type
conf
DOI
10.1109/ASRU.2011.6163976
Filename
6163976
Link To Document