• DocumentCode
    937244
  • Title

    Multonic Markov word models for large vocabulary continuous speech recognition

  • Author

    Bahl, Lalit R. ; Bellegarda, Jerome R. ; De Souza, Peter V. ; Gopalakrishnan, P.S. ; Nahamoo, David ; Picheny, Michael A.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • Issue
    3
  • fYear
    1993
  • fDate
    7/1/1993 12:00:00 AM
  • Firstpage
    334
  • Lastpage
    344
  • Abstract
    A new class of hidden Markov models is proposed for the acoustic representation of words in an automatic speech recognition system. The models, built from combinations of acoustically based sub-word units called fenones, are derived automatically from one or more sample utterances of a word. Because they are more flexible than previously reported fenone-based word models, they lead to an improved capability of modeling variations in pronunciation. They are therefore particularly useful in the recognition of continuous speech. In addition, their construction is relatively simple, because it can be done using the well-known forward-backward algorithm for parameter estimation of hidden Markov models. Appropriate reestimation formulas are derived for this purpose. Experimental results obtained on a 5000-word vocabulary natural language continuous speech recognition task are presented to illustrate the enhanced power of discrimination of the new models
  • Keywords
    hidden Markov models; parameter estimation; speech recognition; HMM; acoustic representation of words; continuous speech; fenones; forward-backward algorithm; hidden Markov models; large vocabulary; multonic Markov word models; parameter estimation; pronunciation; reestimation formulas; speech recognition; Automatic speech recognition; Decoding; Equations; Hidden Markov models; Loudspeakers; Natural languages; Parameter estimation; Power system modeling; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.232617
  • Filename
    232617