• DocumentCode
    1920102
  • Title

    On the phonetic structure of a large hidden Markov model

  • Author

    Pepper, David J. ; Clements, Mark A.

  • Author_Institution
    Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    465
  • Abstract
    It is shown that the structure of a large ergodic hidden Markov model (HMM) can be decomposed into a set of substructures representing the English phonemes. The large HMM, which is trained using a standard forward-backward algorithm, is shown to be organized in a way that reflects the phonetic nature of speech. It is shown that the states of the HMM can be classified in terms of a set of broad phonetic classes and that the spectra associated with the states are related to each state´s use in the phonetic models. The phonetic models are shown to have internal structures reflecting the acoustic nature of the individual phonemes. The large HMMs used in this study are trained using the continuous speech multi-speaker TIMIT database employing a continuous observation density training algorithm. On a subset of the database, with 80 male speakers used for training and a separate set of 24 speakers reserved for testing, the phonetic recognition system achieved a 52% recognition rate with 14% insertions
  • Keywords
    Markov processes; speech recognition; English phonemes; HMM; broad phonetic classes; continuous observation density training algorithm; continuous speech multi-speaker TIMIT database; internal structures; large ergodic hidden Markov model; male speakers; phoneme acoustic nature; phonetic recognition system; speech recognition; standard forward-backward algorithm; state associated spectra; Automatic speech recognition; Automatic testing; Cepstral analysis; Clustering algorithms; Databases; Hidden Markov models; Loudspeakers; Speech analysis; Speech recognition; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150377
  • Filename
    150377