On the phonetic structure of a large hidden Markov model

Author

Pepper, David J. ; Clements, Mark A.

Author_Institution

Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA

fYear

1991

fDate

14-17 Apr 1991

Firstpage

465

Abstract

It is shown that the structure of a large ergodic hidden Markov model (HMM) can be decomposed into a set of substructures representing the English phonemes. The large HMM, which is trained using a standard forward-backward algorithm, is shown to be organized in a way that reflects the phonetic nature of speech. It is shown that the states of the HMM can be classified in terms of a set of broad phonetic classes and that the spectra associated with the states are related to each state´s use in the phonetic models. The phonetic models are shown to have internal structures reflecting the acoustic nature of the individual phonemes. The large HMMs used in this study are trained using the continuous speech multi-speaker TIMIT database employing a continuous observation density training algorithm. On a subset of the database, with 80 male speakers used for training and a separate set of 24 speakers reserved for testing, the phonetic recognition system achieved a 52% recognition rate with 14% insertions

Keywords

Markov processes; speech recognition; English phonemes; HMM; broad phonetic classes; continuous observation density training algorithm; continuous speech multi-speaker TIMIT database; internal structures; large ergodic hidden Markov model; male speakers; phoneme acoustic nature; phonetic recognition system; speech recognition; standard forward-backward algorithm; state associated spectra; Automatic speech recognition; Automatic testing; Cepstral analysis; Clustering algorithms; Databases; Hidden Markov models; Loudspeakers; Speech analysis; Speech recognition; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location

Toronto, Ont.

ISSN

1520-6149

Print_ISBN

0-7803-0003-3

Type

conf

DOI

10.1109/ICASSP.1991.150377

Filename

150377