• DocumentCode
    312072
  • Title

    Segmentation of spoken dialogue by interjections, disfluent utterances and pauses

  • Author

    Takagi, Kazuyuki ; Itahashi, Shuichi

  • Author_Institution
    Univ. of Electro-Commun., Tokyo, Japan
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    697
  • Abstract
    The paper attempts to segment spontaneous speech of human to human spoken dialogues into a relatively large unit of speech, that is, a sub phrasal unit segmented by interjections, disfluent utterances and pauses. A spontaneous speech model incorporating prosody was developed, in which three kinds of speech segment models and the transition probabilities among them were specified. The segmentation experiments showed that 87.6% of the segment boundaries were located correctly within 50 msec, 81.2% within 30 msec, which showed 10.1 point increase in performance comparing with the initial model without prosodic information
  • Keywords
    interactive systems; probability; speech processing; disfluent utterances; human to human spoken dialogues; interjections; pauses; prosody; speech segment models; spoken dialogue segmentation; spontaneous speech model; spontaneous speech segmentation; sub phrasal unit; transition probabilities; Hidden Markov models; Labeling; Speech analysis; Speech processing; Speech recognition; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607457
  • Filename
    607457