• DocumentCode
    75420
  • Title

    Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers

  • Author

    Martinez-Hinarejos, Carlos-D ; Benedi, Jose-Miguel ; Tamarit, Vicent

  • Author_Institution
    Pattern Recognition & Human Language Technol. Center, Univ. Politec. de Valencia, Valencia, Spain
  • Volume
    23
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 2015
  • Firstpage
    198
  • Lastpage
    211
  • Abstract
    Most studies on dialogue corpora, as well as most dialogue systems, employ dialogue acts as the basic units for interpreting discourse structure, user input and system actions. The definition of the discourse structure and the dialogue strategy consequently require the tagging of dialogue corpora in terms of dialogue acts. The tagging problem presents two basic variants: a batch variant (annotation of whole dialogues, in order to define dialogue strategy or study discourse structure) and an online variant (decoding of the dialogue act sequence of a given turn, in order to interpret user intentions). In the two variants is unusual having the segmentation of each turn into the dialogue meaningful units (segments) to which a dialogue act is assigned. In this paper we present the use of the N-Gram Transducer technique for tagging dialogues, without needing to provide a prior segmentation, in these two different variants (dialogue annotation and turn decoding). Experiments were performed in two corpora of different nature and results show that N-Gram Transducer models are suitable for these tasks and provide good performance.
  • Keywords
    interactive systems; natural language processing; batch variant; dialogue corpora; dialogue tagging; discourse structure; n-gram transducer technique; online variant; system actions; unsegmented dialogue act annotation; unsegmented dialogue act decoding; user input; Decoding; Hidden Markov models; Speech; Speech processing; Speech recognition; Tagging; Transducers; Dialogue annotation; n-gram transducer; spoken dialogue systems;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2377595
  • Filename
    6975044