Title :
Use of generalized dynamic feature parameters for speech recognition: maximum likelihood and minimum classification error approaches
Author :
Rathinavelu, C. ; Deng, L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Abstract :
In this study we implemented a speech recognizer based on the integrated view, proposed first by Deng (see IEEE Signal Processing Letters, vol.1, no.4, p.66-69, 1994), on the speech preprocessing and speech modeling problems in the recognizer design. The integrated model we developed generalizes the conventional, currently widely used delta-parameter technique, which has been confined strictly to the preprocessing domain only, in two significant ways. First, the new model contains state-dependent weighting functions responsible for transforming static speech features into the dynamic ones in a slowly time-varying manner. Second, novel maximum-likelihood and minimum-classification-error based learning algorithms are developed for the model that allows joint optimization of the state-dependent weighting functions and the remaining conventional HMM parameters. The experimental results obtained from a standard TIMIT phonetic classification task provide preliminary evidence for the effectiveness of our new, general approaches to the use of the dynamic characteristics of speech spectra
Keywords :
error analysis; hidden Markov models; maximum likelihood estimation; spectral analysis; speech processing; speech recognition; HMM parameters; TIMIT phonetic classification task; delta-parameter technique; dynamic characteristics; dynamic speech features; experimental results; generalized dynamic feature parameters; integrated model; learning algorithms; maximum likelihood classification error; minimum classification error; recognizer design; speech modeling; speech preprocessing; speech recognition; speech recognizer; speech spectra; state-dependent weighting functions; static speech features; Computer errors; Design optimization; Hidden Markov models; Speech enhancement; Speech processing; Speech recognition; Vectors; Yttrium;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479599