Decision of response timing for incremental speech recognition with reinforcement learning

Author

Lu, Di ; Nishimoto, Takuya ; Minematsu, Nobuaki

Author_Institution

Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan

fYear

2011

fDate

11-15 Dec. 2011

Firstpage

467

Lastpage

472

Abstract

In spoken dialog systems, it is important to reduce the delay in generating a response to a user´s utterance. We investigate the use of incremental recognition results which can be obtained from a speech recognition engine before the input utterance ends. To enable the system to respond correctly before the end of the utterance, it is desired to utilize the incremental results effectively, although they are not reliable enough. We formulate this problem as a decision making task, in which the system makes choices iteratively either to answer based on previous observations, or to wait until the next observation. The reinforcement learning can be applied to the problem. As the results of experiments, the users highly evaluate the proposed method which estimate completion time of a user´s utterance by using the results of speech recognition based on mora units.

Keywords

decision making; learning (artificial intelligence); speech recognition; completion time estimation; decision making task; incremental speech recognition; mora units; reinforcement learning; response timing; speech recognition engine; spoken dialog systems; user utterance; Delay; Error analysis; Learning; Speech; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on

Conference_Location

Waikoloa, HI

Print_ISBN

978-1-4673-0365-1

Electronic_ISBN

978-1-4673-0366-8

Type

conf

DOI

10.1109/ASRU.2011.6163976

Filename

6163976