DocumentCode
1749638
Title
Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system
Author
Droppo, Jasha ; Acero, Alex ; Deng, Li
Author_Institution
Microsoft Corp., Redmond, WA, USA
Volume
1
fYear
2001
fDate
2001
Firstpage
209
Abstract
There exists a number of cepstral de-noising algorithms which perform quite well when trained and tested under similar acoustic environments, but degrade quickly under mismatched conditions. We present two key results that make these algorithms practical in real noise environments, with the ability to adapt to different acoustic environments over time. First, we show that it is possible to leverage the existing de-noising computations to estimate the acoustic environment on-line and in real time. Second, we show that it is not necessary to collect large amounts of training data in each environment-clean data with artificial mixing is sufficient. When this new method is used as a pre-processing stage to a large vocabulary speech recognition system, it can be made robust to a wide variety of acoustic environments. With synthetic training data, we are able to reduce the word error rate by 27%
Keywords
Bayes methods; acoustic noise; cepstral analysis; estimation theory; probability; speech recognition; FCDCN; cepstral de-noising algorithms; continuous speech recognition system; large vocabulary speech recognition system; online acoustic environment estimation; pre-processing; Acoustic noise; Acoustic testing; Cepstral analysis; Degradation; Noise reduction; Performance evaluation; Speech recognition; Training data; Vocabulary; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940804
Filename
940804
Link To Document