• DocumentCode
    290108
  • Title

    Application of vector quantized hidden Markov modeling to telephone network based connected digit recognition

  • Author

    Buhrke, E.R. ; Cardin, Regis ; Normandin, Yves ; Rahim, Mazin ; Wilpon, Jay

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    i
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    Connected digit speech recognition in the telephone network is becoming increasingly more important as the demand for speech technology becomes widespread. In the past few years, several highly successful techniques for recognizing spoken connected digit strings have been proposed. Although these techniques have been applied to non-telephone based speech [e.g. Texas Instruments database], they have produced high recognition performance. Further, similar levels of performances have been demonstrated using discrete density and continuous density based hidden Markov models (HMMs). The success of the vector quantized (VQ) modeling approach, in particular, is encouraging and rather important from the viewpoint of computational efficiency. This paper presents a study of connected digit recognition on telephone network based data using VQ HMMs. We investigate several factors affecting the error rate of VQ HMMs-such as maximum mutual information (MMI) training, sender modeling, and codebook size-and measure their contributions to recognition accuracy. The model architecture, number of states and transitions, is also optimized and its contribution to overall performance discussed
  • Keywords
    hidden Markov models; speech coding; speech recognition; telephone networks; telephony; vector quantisation; VQ HMM; codebook size; computational efficiency; connected digit speech recognition; error rate; hidden Markov models; maximum mutual information training; model architecture; performance; recognition accuracy; recognition performance; sender modeling; speech technology; telephone network; vector quantized hidden Markov modeling; Computational efficiency; Computer architecture; Databases; Error analysis; Hidden Markov models; Instruments; Mutual information; Size measurement; Speech recognition; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389344
  • Filename
    389344