• DocumentCode
    1274460
  • Title

    Development of an acoustic-phonetic hidden Markov model for continuous speech recognition

  • Author

    Ljolje, Andrej ; Levinson, Stephen E.

  • Author_Institution
    AT&T Bell Lab., Murray Hill, NJ, USA
  • Volume
    39
  • Issue
    1
  • fYear
    1991
  • fDate
    1/1/1991 12:00:00 AM
  • Firstpage
    29
  • Lastpage
    39
  • Abstract
    The techniques used to develop an acoustic-phonetic hidden Markov model, the problems associated with representing the whole acoustic-phonetic structure, the characteristics of the model, and how it performs as a phonetic decoder for recognition of fluent speech are discussed. The continuous variable duration model was trained using 450 sentences of fluent speech, each of which was spoken by a single speaker, and segmented and labeled using a fixed number of phonemes, each of which has a direct correspondence to the states of the matrix. The inherent variability of each phoneme is modeled as the observable random process of the Markov chain, while the phonotactic model of the unobservable phonetic sequence is represented by the state transition matrix of the hidden Markov model. The model assumes that the observed spectral data were generated by a Gaussian source. However, an analysis of the data shows that the spectra for the most of the phonemes are not normally distributed and that an alternative representation would be beneficial
  • Keywords
    Markov processes; speech recognition; Gaussian source; Markov chain; acoustic-phonetic structure; continuous speech recognition; continuous variable duration model; fluent speech; hidden Markov model; observable random process; phoneme; phonetic decoder; phonotactic model; state transition matrix; Cepstral analysis; Character recognition; Data analysis; Decoding; Hidden Markov models; Loudspeakers; Natural languages; Pattern recognition; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/78.80762
  • Filename
    80762