Title :
Application of vector quantized hidden Markov modeling to telephone network based connected digit recognition
Author :
Buhrke, E.R. ; Cardin, Regis ; Normandin, Yves ; Rahim, Mazin ; Wilpon, Jay
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
Connected digit speech recognition in the telephone network is becoming increasingly more important as the demand for speech technology becomes widespread. In the past few years, several highly successful techniques for recognizing spoken connected digit strings have been proposed. Although these techniques have been applied to non-telephone based speech [e.g. Texas Instruments database], they have produced high recognition performance. Further, similar levels of performances have been demonstrated using discrete density and continuous density based hidden Markov models (HMMs). The success of the vector quantized (VQ) modeling approach, in particular, is encouraging and rather important from the viewpoint of computational efficiency. This paper presents a study of connected digit recognition on telephone network based data using VQ HMMs. We investigate several factors affecting the error rate of VQ HMMs-such as maximum mutual information (MMI) training, sender modeling, and codebook size-and measure their contributions to recognition accuracy. The model architecture, number of states and transitions, is also optimized and its contribution to overall performance discussed
Keywords :
hidden Markov models; speech coding; speech recognition; telephone networks; telephony; vector quantisation; VQ HMM; codebook size; computational efficiency; connected digit speech recognition; error rate; hidden Markov models; maximum mutual information training; model architecture; performance; recognition accuracy; recognition performance; sender modeling; speech technology; telephone network; vector quantized hidden Markov modeling; Computational efficiency; Computer architecture; Databases; Error analysis; Hidden Markov models; Instruments; Mutual information; Size measurement; Speech recognition; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389344