• DocumentCode
    3485986
  • Title

    Decision of response timing for incremental speech recognition with reinforcement learning

  • Author

    Lu, Di ; Nishimoto, Takuya ; Minematsu, Nobuaki

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    467
  • Lastpage
    472
  • Abstract
    In spoken dialog systems, it is important to reduce the delay in generating a response to a user´s utterance. We investigate the use of incremental recognition results which can be obtained from a speech recognition engine before the input utterance ends. To enable the system to respond correctly before the end of the utterance, it is desired to utilize the incremental results effectively, although they are not reliable enough. We formulate this problem as a decision making task, in which the system makes choices iteratively either to answer based on previous observations, or to wait until the next observation. The reinforcement learning can be applied to the problem. As the results of experiments, the users highly evaluate the proposed method which estimate completion time of a user´s utterance by using the results of speech recognition based on mora units.
  • Keywords
    decision making; learning (artificial intelligence); speech recognition; completion time estimation; decision making task; incremental speech recognition; mora units; reinforcement learning; response timing; speech recognition engine; spoken dialog systems; user utterance; Delay; Error analysis; Learning; Speech; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163976
  • Filename
    6163976