Title :
Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection
Author :
Tomalin, M. ; Woodland, P.C.
Author_Institution :
Dept. of Eng., Cambridge Univ.
Abstract :
This paper compares the performance of two types of prosodic feature models (PFMs) in a sentence boundary detection task. Specifically, systems are compared that use discriminatively trained Gaussian mixture models (MMI-GMMs) and CART-style decision trees (CDT-PFMs), along with task-specific language models, in a lattice-based decoding framework in order automatically to insert slash unit (SU) boundaries into automatic speech recognition (ASR) transcriptions of input audio files. It is shown that a system which uses MMI-GMMs performs as well as a system that uses conventional CDT-PFMs. In addition, it is shown that, when the CDT-PFM and MMI-GMM systems are combined by taking weighted averages of their respective probability streams, error rate improvements of up to 0.8% abs over the CDT-PFM baseline can be obtained for four different test sets
Keywords :
Gaussian processes; decision trees; decoding; probability; speech recognition; CART-style decision trees; automatic speech recognition transcriptions; discriminatively trained Gaussian mixture models; input audio files; lattice-based decoding framework; prosodic feature models; sentence boundary detection; slash unit boundaries; task-specific language models; Automatic speech recognition; Decision trees; Decoding; Ear; Error analysis; Model driven engineering; Natural languages; Streaming media; System testing; Training data;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660079