Title :
Time-varying discriminative feature extraction effective for phonetic discrimination
Author :
Chengalvarayon, R.
Author_Institution :
Lucent Technol., AT&T Bell Labs., Naperville, IL
Abstract :
We investigate the interactions of front-end preprocessing and back-end classification techniques in hidden Markov model (HMM) based speech recognition. The proposed model aims at finding an optimal linear transformation on the mel-warped discrete Fourier transform (DFT) with the construction of dynamic feature parameters according to the minimum classification error (MCE) criterion. Experimental results show that state-dependent transformation on mel-warped DFT features is superior in performance to the mel-frequency cepstral coefficients (MFCCs). An error rate reduction of 15% is obtained on a standard 39-class TIMIT phone classification task in comparison with the conventional MCE-trained HMM using MFCCs and delta MFCCs that have not been subject to optimization during training
Keywords :
discrete Fourier transforms; feature extraction; hidden Markov models; parameter estimation; pattern classification; speech recognition; time-varying filters; 39-class TIMIT phone classification; HMM; MFCC; back-end classification; dynamic feature parameters; error rate reduction; front-end preprocessing; hidden Markov model; mel-frequency cepstral coefficients; mel-warped DFT features; mel-warped discrete Fourier transform; minimum classification error criterion; optimal linear transformation; performance; phonetic discrimination; speech recognition; state-dependent transformation; time-varying discriminative feature extraction; Data mining; Discrete Fourier transforms; Discrete cosine transforms; Error analysis; Feature extraction; Hidden Markov models; Nonlinear filters; Speech analysis; Speech processing; Speech recognition;
Conference_Titel :
Information, Communications and Signal Processing, 1997. ICICS., Proceedings of 1997 International Conference on
Print_ISBN :
0-7803-3676-3
DOI :
10.1109/ICICS.1997.652082