• DocumentCode
    2253972
  • Title

    Modeling disfluencies in conversational speech

  • Author

    Siu, Man-Hung ; Ostendorf, Mari

  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    386
  • Abstract
    Conversational speech is notably different from read speech in several ways, particularly in the presence of disfluencies but also in the frequent use of a small set of words that mark the flow of the discourse. Disfluencies are sometimes viewed as a “problem” in language modeling, where most previous work has focused on written text. In this paper, we take the view that disfluencies provide information themselves. In particular, we give evidence that filled pauses serve different functions, including marking linguistic unit and restart boundaries, and signaling hesitation where the speaker wants to hold the floor. The different functions can be connected to similar functions of other words common in spontaneous but not written speech, and the particular function affects the word conditioning choices in a variable n-gram model. Thus, at least some of the idiosyncrasies of spontaneous speech can be viewed as a source of information for language modeling rather than an interruption in the linguistic structure
  • Keywords
    linguistics; nomograms; conversational speech; discourse flow; disfluencies; filled pauses; hesitation signals; idiosyncrasies; information content; interruptions; language modeling; linguistic structure; linguistic unit boundary marking; restart boundary marking; spontaneous speech; variable n-gram model; word conditioning choices; words; History; Information resources; Natural languages; Speech recognition; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607135
  • Filename
    607135