DocumentCode
1343451
Title
Linear trajectory models incorporating preprocessing parameters for speech recognition
Author
Chengalvarayan, Rathinavelu
Author_Institution
Lucent Technol., Bell Labs., Naperville, IL, USA
Volume
5
Issue
3
fYear
1998
fDate
3/1/1998 12:00:00 AM
Firstpage
66
Lastpage
68
Abstract
In this letter, we investigate the interactions of front-end feature extraction and back-end classification techniques in nonstationary state hidden Markov model (NSHMM) based speech recognition. The proposed model aims at finding an optimal linear transformation on the mel-warped discrete Fourier transform (DFT) features according to the minimum classification error (MCE) criterion. This linear transformation, along with the NSHMM parameters, are automatically trained using the gradient descent method. An error rate reduction of 8% is obtained on a standard 39-class TIMIT phone classification task in comparison with the MCE-trained NSHMM using conventional preprocessing techniques.
Keywords
discrete Fourier transforms; feature extraction; hidden Markov models; pattern classification; speech recognition; DFT features; HMM; TIMIT phone classification task; back-end classification; error rate reduction; front-end feature extraction; gradient descent method; linear trajectory models; mel-warped discrete Fourier transform; minimum classification error criterion; nonstationary state hidden Markov model; optimal linear transformation; preprocessing parameters; speech recognition; Cepstral analysis; Context modeling; Discrete Fourier transforms; Discrete cosine transforms; Error analysis; Feature extraction; Hidden Markov models; Speech recognition; Vectors;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/97.661563
Filename
661563
Link To Document