DocumentCode :
284652
Title :
Spectral contrast normalization and other techniques for speech recognition in noise
Author :
Bateman, D.C. ; Bye, D.K. ; Hunt, M.J.
Author_Institution :
Marconi Speech & Information Systems, Portsmouth, UK
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
241
Abstract :
Three methods of improving speech recognition in noise are considered: energy thresholding, a noise-robust spectral representation called IMELDA, and a set of noise-robust spectral distortion measures. The spectral distortion measures can be seen as normalizing the contrast in the spectrum, a problem which can be transferred to the representation itself, making it computationally more efficient. In speaker-independent alphabet recognition tests in added steady white noise at various levels, IMELDA is shown to outperform a weighted cepstrum representation and be computationally more efficient. With this material and with digits recorded in trucks at a wide range of noise levels, performance is found to depend strongly on the threshold level. Contrast normalization is found to help, but only when the energy threshold is far from its optimum level
Keywords :
spectral analysis; speech recognition; white noise; IMELDA; energy thresholding; noise-robust spectral distortion measures; noise-robust spectral representation; performance; speaker-independent alphabet recognition tests; spectral contrast normalization; speech recognition; white noise; Acoustic distortion; Acoustic measurements; Acoustic noise; Distortion measurement; Noise level; Noise measurement; Noise robustness; Speech enhancement; Speech recognition; White noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225927
Filename :
225927
Link To Document :
بازگشت