Title :
Linear trajectory models incorporating preprocessing parameters for speech recognition
Author :
Chengalvarayan, Rathinavelu
Author_Institution :
Lucent Technol., Bell Labs., Naperville, IL, USA
fDate :
3/1/1998 12:00:00 AM
Abstract :
In this letter, we investigate the interactions of front-end feature extraction and back-end classification techniques in nonstationary state hidden Markov model (NSHMM) based speech recognition. The proposed model aims at finding an optimal linear transformation on the mel-warped discrete Fourier transform (DFT) features according to the minimum classification error (MCE) criterion. This linear transformation, along with the NSHMM parameters, are automatically trained using the gradient descent method. An error rate reduction of 8% is obtained on a standard 39-class TIMIT phone classification task in comparison with the MCE-trained NSHMM using conventional preprocessing techniques.
Keywords :
discrete Fourier transforms; feature extraction; hidden Markov models; pattern classification; speech recognition; DFT features; HMM; TIMIT phone classification task; back-end classification; error rate reduction; front-end feature extraction; gradient descent method; linear trajectory models; mel-warped discrete Fourier transform; minimum classification error criterion; nonstationary state hidden Markov model; optimal linear transformation; preprocessing parameters; speech recognition; Cepstral analysis; Context modeling; Discrete Fourier transforms; Discrete cosine transforms; Error analysis; Feature extraction; Hidden Markov models; Speech recognition; Vectors;
Journal_Title :
Signal Processing Letters, IEEE