Title :
Normalizing the Speech Modulation Spectrum for Robust Speech Recognition
Author :
Xiong Xiao ; Eng Siong Chng ; Haizhou Li
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
Abstract :
This paper presents a novel feature normalization technique for robust speech recognition. The proposed technique normalizes the temporal structure of the feature to reduce the feature variation due to environmental interferences. Specifically, it normalizes the utterance-dependent feature modulation spectrum to a reference function by filtering the feature using a square-root Wiener filter in the temporal domain. We show experimentally that the proposed technique when combined with mean and variance normalization technique (MVN) reduces the word error rate significantly on the AURORA-2 task, with relative error rate reduction 69.11% compared to the baseline.
Keywords :
Wiener filters; filtering theory; modulation; speech processing; speech recognition; feature normalization technique; filtering; robust speech recognition; speech modulation spectrum; square-root Wiener filter; utterance-dependent feature modulation spectrum; variance normalization technique; Additive noise; Automatic speech recognition; Cepstral analysis; Error analysis; Frequency modulation; Histograms; Interference; Noise robustness; Speech recognition; Statistical distributions; Speech recognition; feature normalization; modulation spectrum; square-root Wiener filter; temporal filter;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367246