DocumentCode
381269
Title
Robust speech recognition with multi-channel codebook dependent cepstral normalization (MCDCN)
Author
Deligne, Sabine ; Gopinath, Ramesh
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2001
fDate
2001
Firstpage
151
Lastpage
154
Abstract
We address the issue of speech recognition in the presence of interfering signals, in cases where the signals corrupting the speech are recorded in separate channels. We propose to combine a trivial form of filtering with MCDCN, a multi-channel version of codebook dependent cepstral normalization, where the cepstra of the noise are estimated from the reference signals. We report on recognition experiments in a car where the speech signal is corrupted by radio talks or CD music played by the car speakers. Our approach allows relative word error rate reductions in the range of 70-90% compared to a no-compensation baseline, at a relatively low computational cost.
Keywords
acoustic noise; cepstral analysis; error statistics; interference (signal); parameter estimation; speech recognition; cepstra estimation; interfering signals; multi-channel codebook dependent cepstral normalization; robust speech recognition; word error rate; Acoustic noise; Adaptive filters; Cepstral analysis; Decorrelation; Filtering; Linear systems; Nonlinear filters; Robustness; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034610
Filename
1034610
Link To Document