DocumentCode
352355
Title
LDA derived cepstral trajectory filters in adverse environmental conditions
Author
Lieb, Markus ; Haeb-Umbach, Reinhold
Author_Institution
Philips GmbH Forschungslab., Aachen, Germany
Volume
2
fYear
2000
fDate
2000
Abstract
Amongst several data driven approaches for designing filters for the time sequence of spectral parameters, the linear discriminant analysis (LDA) based method has been proposed for automatic speech recognition. Here we apply LDA-based filter design to cepstral features, which better match the inherent assumption of this method that feature vector components are uncorrelated. Extensive recognition experiments have been conducted both on the standard TIMIT phone recognition task and on a proprietary 130-words command word task under various adverse environmental conditions, including reverberant data with real-life room impulse responses and data processed by acoustic echo cancellation algorithms. Significant error rate reductions have been achieved when applying the novel long-range feature filters compared to standard approaches employing cepstral mean normalization and delta and delta-delta features, in particular when facing acoustic echo cancellation scenarios and room reverberation. For example, the phone accuracy on reverberated TIMIT data could be increased from 50.7% to 56.0%
Keywords
FIR filters; cepstral analysis; echo suppression; speech recognition; LDA derived cepstral trajectory filters; acoustic echo cancellation algorithms; adverse environmental conditions; automatic speech recognition; cepstral features; cepstral mean normalization; command word task; delta features; delta-delta features; error rate reductions; feature vector components; linear discriminant analysis; long-range feature filters; phone accuracy; real-life room impulse responses; reverberant data; spectral parameters; standard TIMIT phone recognition task; Automatic speech recognition; Cepstral analysis; Decoding; Finite impulse response filter; Frequency; Linear discriminant analysis; Nonlinear filters; Spatial databases; Speech recognition; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.859157
Filename
859157
Link To Document