DocumentCode :
2839111
Title :
Energy contour enhancement for noisy speech recognition
Author :
Hwang, Tai-Hwei ; Chang, Sen-Chia
Author_Institution :
E000/Comput. & Commun. Labs, Ind. Technol. Res. Inst., Hsinchu, Taiwan
fYear :
2004
fDate :
15-18 Dec. 2004
Firstpage :
249
Lastpage :
252
Abstract :
Environmental noise, known as an additive noise, not only corrupts the spectra of a speech signal but also blurs the shape of its energy contour. The corruption of the energy contour can distort the energy derived feature and degrade the pattern classification performance of noisy speech. To reduce the distortion of the energy feature, the energy bias in the energy contour has to be removed before the feature extraction. For this purpose, we propose two methods to estimate the noise energy; one is obtained from the speech inactive period, and one is from the noisy speech itself. The methods are evaluated by the connected digit recognition of TIDigits, in which the test speech is corrupted with white noise, babble, factory noise, and in-car noises. As shown in the experiments, the energy enhancement can provide an additional improvement when it is jointly applied with a spectral subtraction.
Keywords :
acoustic noise; feature extraction; pattern classification; speech enhancement; speech recognition; white noise; ASR; TIDigits database; automatic speech recognition; babble; connected digit recognition; energy contour bias removal; energy contour enhancement; energy contour shape blurring; energy derived features; energy feature distortion; environmental additive noise; factory noise; feature extraction; in-car noises; noise energy estimation; noisy speech pattern classification; noisy speech recognition; spectral subtraction; speech inactive period; speech signal spectra corruption; white noise; Additive noise; Degradation; Feature extraction; Noise shaping; Pattern classification; Shape; Speech enhancement; Speech recognition; Testing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
Type :
conf
DOI :
10.1109/CHINSL.2004.1409633
Filename :
1409633
Link To Document :
بازگشت