DocumentCode
2253972
Title
Modeling disfluencies in conversational speech
Author
Siu, Man-Hung ; Ostendorf, Mari
Volume
1
fYear
1996
fDate
3-6 Oct 1996
Firstpage
386
Abstract
Conversational speech is notably different from read speech in several ways, particularly in the presence of disfluencies but also in the frequent use of a small set of words that mark the flow of the discourse. Disfluencies are sometimes viewed as a “problem” in language modeling, where most previous work has focused on written text. In this paper, we take the view that disfluencies provide information themselves. In particular, we give evidence that filled pauses serve different functions, including marking linguistic unit and restart boundaries, and signaling hesitation where the speaker wants to hold the floor. The different functions can be connected to similar functions of other words common in spontaneous but not written speech, and the particular function affects the word conditioning choices in a variable n-gram model. Thus, at least some of the idiosyncrasies of spontaneous speech can be viewed as a source of information for language modeling rather than an interruption in the linguistic structure
Keywords
linguistics; nomograms; conversational speech; discourse flow; disfluencies; filled pauses; hesitation signals; idiosyncrasies; information content; interruptions; language modeling; linguistic structure; linguistic unit boundary marking; restart boundary marking; spontaneous speech; variable n-gram model; word conditioning choices; words; History; Information resources; Natural languages; Speech recognition; Telephony; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607135
Filename
607135
Link To Document