DocumentCode :
1920102
Title :
On the phonetic structure of a large hidden Markov model
Author :
Pepper, David J. ; Clements, Mark A.
Author_Institution :
Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
465
Abstract :
It is shown that the structure of a large ergodic hidden Markov model (HMM) can be decomposed into a set of substructures representing the English phonemes. The large HMM, which is trained using a standard forward-backward algorithm, is shown to be organized in a way that reflects the phonetic nature of speech. It is shown that the states of the HMM can be classified in terms of a set of broad phonetic classes and that the spectra associated with the states are related to each state´s use in the phonetic models. The phonetic models are shown to have internal structures reflecting the acoustic nature of the individual phonemes. The large HMMs used in this study are trained using the continuous speech multi-speaker TIMIT database employing a continuous observation density training algorithm. On a subset of the database, with 80 male speakers used for training and a separate set of 24 speakers reserved for testing, the phonetic recognition system achieved a 52% recognition rate with 14% insertions
Keywords :
Markov processes; speech recognition; English phonemes; HMM; broad phonetic classes; continuous observation density training algorithm; continuous speech multi-speaker TIMIT database; internal structures; large ergodic hidden Markov model; male speakers; phoneme acoustic nature; phonetic recognition system; speech recognition; standard forward-backward algorithm; state associated spectra; Automatic speech recognition; Automatic testing; Cepstral analysis; Clustering algorithms; Databases; Hidden Markov models; Loudspeakers; Speech analysis; Speech recognition; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150377
Filename :
150377
Link To Document :
بازگشت