Title :
LDA derived cepstral trajectory filters in adverse environmental conditions
Author :
Lieb, Markus ; Haeb-Umbach, Reinhold
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
Abstract :
Amongst several data driven approaches for designing filters for the time sequence of spectral parameters, the linear discriminant analysis (LDA) based method has been proposed for automatic speech recognition. Here we apply LDA-based filter design to cepstral features, which better match the inherent assumption of this method that feature vector components are uncorrelated. Extensive recognition experiments have been conducted both on the standard TIMIT phone recognition task and on a proprietary 130-words command word task under various adverse environmental conditions, including reverberant data with real-life room impulse responses and data processed by acoustic echo cancellation algorithms. Significant error rate reductions have been achieved when applying the novel long-range feature filters compared to standard approaches employing cepstral mean normalization and delta and delta-delta features, in particular when facing acoustic echo cancellation scenarios and room reverberation. For example, the phone accuracy on reverberated TIMIT data could be increased from 50.7% to 56.0%
Keywords :
FIR filters; cepstral analysis; echo suppression; speech recognition; LDA derived cepstral trajectory filters; acoustic echo cancellation algorithms; adverse environmental conditions; automatic speech recognition; cepstral features; cepstral mean normalization; command word task; delta features; delta-delta features; error rate reductions; feature vector components; linear discriminant analysis; long-range feature filters; phone accuracy; real-life room impulse responses; reverberant data; spectral parameters; standard TIMIT phone recognition task; Automatic speech recognition; Cepstral analysis; Decoding; Finite impulse response filter; Frequency; Linear discriminant analysis; Nonlinear filters; Spatial databases; Speech recognition; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.859157