DocumentCode
290108
Title
Application of vector quantized hidden Markov modeling to telephone network based connected digit recognition
Author
Buhrke, E.R. ; Cardin, Regis ; Normandin, Yves ; Rahim, Mazin ; Wilpon, Jay
Author_Institution
AT&T Bell Labs., Murray Hill, NJ, USA
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
Connected digit speech recognition in the telephone network is becoming increasingly more important as the demand for speech technology becomes widespread. In the past few years, several highly successful techniques for recognizing spoken connected digit strings have been proposed. Although these techniques have been applied to non-telephone based speech [e.g. Texas Instruments database], they have produced high recognition performance. Further, similar levels of performances have been demonstrated using discrete density and continuous density based hidden Markov models (HMMs). The success of the vector quantized (VQ) modeling approach, in particular, is encouraging and rather important from the viewpoint of computational efficiency. This paper presents a study of connected digit recognition on telephone network based data using VQ HMMs. We investigate several factors affecting the error rate of VQ HMMs-such as maximum mutual information (MMI) training, sender modeling, and codebook size-and measure their contributions to recognition accuracy. The model architecture, number of states and transitions, is also optimized and its contribution to overall performance discussed
Keywords
hidden Markov models; speech coding; speech recognition; telephone networks; telephony; vector quantisation; VQ HMM; codebook size; computational efficiency; connected digit speech recognition; error rate; hidden Markov models; maximum mutual information training; model architecture; performance; recognition accuracy; recognition performance; sender modeling; speech technology; telephone network; vector quantized hidden Markov modeling; Computational efficiency; Computer architecture; Databases; Error analysis; Hidden Markov models; Instruments; Mutual information; Size measurement; Speech recognition; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389344
Filename
389344
Link To Document