• DocumentCode
    103846
  • Title

    Unsupervised Spoken Language Understanding for a Multi-Domain Dialog System

  • Author

    Donghyeon Lee ; Minwoo Jeong ; Kyungduk Kim ; Seonghan Ryu ; Lee, Gwo Giun

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pohang Univ. of Sci. & Technol. (POSTECH), Pohang, South Korea
  • Volume
    21
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    2451
  • Lastpage
    2464
  • Abstract
    This paper proposes an unsupervised spoken language understanding (SLU) framework for a multi-domain dialog system. Our unsupervised SLU framework applies a non-parametric Bayesian approach to dialog acts, intents and slot entities, which are the components of a semantic frame. The proposed approach reduces the human effort necessary to obtain a semantically annotated corpus for dialog system development. In this study, we analyze clustering results using various evaluation metrics for four dialog corpora. We also introduce a multi-domain dialog system that uses the unsupervised SLU framework. We argue that our unsupervised approach can help overcome the annotation acquisition bottleneck in developing dialog systems. To verify this claim, we report a dialog system evaluation, in which our method achieves competitive results in comparison with a system that uses a manually annotated corpus. In addition, we conducted several experiments to explore the effect of our approach on reducing development costs. The results show that our approach be helpful for the rapid development of a prototype system and reducing the overall development costs.
  • Keywords
    human computer interaction; interactive systems; natural language processing; unsupervised learning; annotation acquisition bottleneck; clustering result; dialog acts; dialog corpora; dialog system development; dialog system evaluation; evaluation metrics; multidomain dialog system; nonparametric Bayesian approach; overall development cost reduction; semantic frame component; slot entities; unsupervised SLU framework; unsupervised spoken language understanding framework; Bayes methods; Grammar; Guidelines; Hidden Markov models; Labeling; Semantics; Training data; Dialog system; spoken language understanding; unsupervised learning;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2280212
  • Filename
    6587768