DocumentCode :
352355
Title :
LDA derived cepstral trajectory filters in adverse environmental conditions
Author :
Lieb, Markus ; Haeb-Umbach, Reinhold
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
Volume :
2
fYear :
2000
fDate :
2000
Abstract :
Amongst several data driven approaches for designing filters for the time sequence of spectral parameters, the linear discriminant analysis (LDA) based method has been proposed for automatic speech recognition. Here we apply LDA-based filter design to cepstral features, which better match the inherent assumption of this method that feature vector components are uncorrelated. Extensive recognition experiments have been conducted both on the standard TIMIT phone recognition task and on a proprietary 130-words command word task under various adverse environmental conditions, including reverberant data with real-life room impulse responses and data processed by acoustic echo cancellation algorithms. Significant error rate reductions have been achieved when applying the novel long-range feature filters compared to standard approaches employing cepstral mean normalization and delta and delta-delta features, in particular when facing acoustic echo cancellation scenarios and room reverberation. For example, the phone accuracy on reverberated TIMIT data could be increased from 50.7% to 56.0%
Keywords :
FIR filters; cepstral analysis; echo suppression; speech recognition; LDA derived cepstral trajectory filters; acoustic echo cancellation algorithms; adverse environmental conditions; automatic speech recognition; cepstral features; cepstral mean normalization; command word task; delta features; delta-delta features; error rate reductions; feature vector components; linear discriminant analysis; long-range feature filters; phone accuracy; real-life room impulse responses; reverberant data; spectral parameters; standard TIMIT phone recognition task; Automatic speech recognition; Cepstral analysis; Decoding; Finite impulse response filter; Frequency; Linear discriminant analysis; Nonlinear filters; Spatial databases; Speech recognition; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.859157
Filename :
859157
Link To Document :
بازگشت