Title :
Statistical segmentation and word modeling techniques in isolated word recognition
Author :
Euler, S. ; Juang, B. ; Lee, G. ; Soong, F.
Author_Institution :
AT&T Bell Lab., Murray Hill, NJ, USA
Abstract :
A speech recognition system is described using a combination of statistical segment and word modeling. Segment models are constructed by first segmenting training data automatically and then grouping the resultant segments into clusters. Mixtures of Gaussian densities are used to model each segment cluster. In order to integrate the segment models into word models, a generalization of the hidden Markov model approach is proposed. Experimental results on a multispeaker recognition system for alpha-digits demonstrate that the new approach improved the performance of conventional whole-word-based models. In particular, the word models show good discrimination abilities for differentiating phonetically similar words such as the E-set alphabet
Keywords :
Markov processes; speech recognition; E-set alphabet; Gaussian densities; acoustic segmentation; hidden Markov model; multispeaker recognition system; segment clustering; speech recognition system; statistical segment; word modeling; Acoustic distortion; Density functional theory; Dynamic programming; Hidden Markov models; Signal analysis; Speech analysis; Speech recognition; Training data; Vocabulary; Yttrium;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115898