Title :
Speech recognition using regularized minimum variance distortionless response spectrum estimation-based cepstral features
Author :
Alam, Mohammad Jahangir ; Kenny, P. ; O´Shaughnessy, D.
Author_Institution :
INRS-EMT, Univ. of Quebec, Montreal, QC, Canada
Abstract :
This paper presents regularized minimum variance distortion-less response (MVDR)-based cepstral features for robust continuous speech recognition. The mel-frequency cepstral coefficient (MFCC) features, widely used in speech recognition tasks, are usually computed from a direct spectrum estimate, that is, the squared magnitude of the discrete Fourier transform (DFT) of speech frames. Direct spectrum estimation methods (also known as nonparametric estimators) perform poorly under noisy and adverse conditions. To reduce this performance drop we propose to increase robustness of the speech recognition system by extracting more robust features based on the regularized MVDR technique. The proposed method, when evaluated on the AURORA-4 speech recognition task, provides an average relative improvement in word accuracy of 11.3%, 6.1%, and 5.2% over the conventional MFCC, PLP, MVDR and PMVDR-based MFCC features, respectively.
Keywords :
cepstral analysis; discrete Fourier transforms; speech recognition; AURORA-4 speech recognition task; adverse condition; direct spectrum estimate; direct spectrum estimation methods; discrete Fourier transform; mel-frequency cepstral coefficient features; noisy condition; nonparametric estimators; regularized MVDR technique; regularized minimum variance distortionless response spectrum estimation-based cepstral features; robust continuous speech recognition; speech frames; speech recognition system; speech recognition tasks; word accuracy; Accuracy; Feature extraction; Mel frequency cepstral coefficient; Robustness; Spectral analysis; Speech; Speech recognition; Speech recognition; linear prediction; regularized MVDR; spectrum estimation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639237