DocumentCode
394325
Title
Perceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition
Author
Yapanel, Umit H. ; Dharanipragada, Satya
Author_Institution
Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper describes a robust feature extraction technique for continuous speech recognition. Central to the technique is the minimum variance distortionless response (MVDR) method of spectrum estimation. We incorporate perceptual information directly in to the spectrum estimation. This provides improved robustness and computational efficiency when compared with the previously proposed MVDR-MFCC technique. On an in-car speech recognition task this method, which we refer to as PMCC, is 15% more accurate in WER and requires approximately a factor of 4 times less computation than the MVDR-MFCC technique. On the same task PMCC yields 20% relative improvement over MFCC and 11% relative improvement over PLP frontends. Similar improvements are observed on the Aurora 2 database.
Keywords
cepstral analysis; error statistics; feature extraction; speech recognition; PMCC; WER; computational efficiency; continuous speech recognition; in-car speech recognition task; minimum variance distortionless response; perceptual MVDR-based cepstral coefficients; robust feature extraction; spectrum estimation; Cepstral analysis; Feature extraction; Finite impulse response filter; Frequency estimation; Natural languages; Power harmonic filters; Robustness; Spectral analysis; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198863
Filename
1198863
Link To Document