Title :
Recognizing disfluencies in conversational speech
Author :
Lease, Matthew ; Johnson, Mark ; Charniak, Eugene
Author_Institution :
Dept. of Comput. Sci., Brown Univ., Providence, RI
Abstract :
We present a system for modeling disfluency in conversational speech: repairs, fillers, and self-interruption points (IPs). For each sentence, candidate repair analyses are generated by a stochastic tree adjoining grammar (TAG) noisy-channel model. A probabilistic syntactic language model scores the fluency of each analysis, and a maximum-entropy model selects the most likely analysis given the language model score and other features. Fillers are detected independently via a small set of deterministic rules, and IPs are detected by combining the output of repair and filler detection modules. In the recent Rich Transcription Fall 2004 (RT-04F) blind evaluation, systems competed to detect these three forms of disfluency under two input conditions: a best-case scenario of manually transcribed words and a fully automatic case of automatic speech recognition (ASR) output. For all three tasks and on both types of input, our system was the top performer in the evaluation
Keywords :
maximum entropy methods; speech recognition; stochastic processes; trees (mathematics); Rich Transcription Fall 2004 blind evaluation; automatic speech recognition; candidate repair analyses; conversational speech; deterministic rules; disfluencies recognition; filler detection; language model score; maximum-entropy model; probabilistic syntactic language model; self-interruption points; stochastic tree adjoining grammar noisy-channel model; Automatic speech recognition; Information processing; Laboratories; Natural language processing; Noise generators; Performance evaluation; Speech analysis; Speech recognition; Stochastic processes; Technical Activities Guide -TAG; Disfluency modeling; natural language processing; rich transcription; speech processing;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.878269