DocumentCode
3153409
Title
Modulation spectrum exponential weighting for robust speech recognition
Author
Hao-teng Fan ; Yi-cheng Lian ; Jeih-weih Hung
Author_Institution
Dept. of Electr. Eng., Nat. Chi Nan Univ., Nantou, Taiwan
fYear
2012
fDate
5-8 Nov. 2012
Firstpage
812
Lastpage
816
Abstract
In this paper, we present a novel scheme to improve the noise robustness of features in speech recognition for vehicle noise environments. In the algorithm termed modulation spectrum exponential weighting (MSEW), the magnitude spectra of feature streams are updated by integrating a reference magnitude spectrum and the original magnitude spectrum with varying exponential weights based on the signal-to-noise ratio (SNR) of the operating environment. Specifically, we present three modes of MSEW, which can be viewed as a generalization of the two algorithms, modulation spectrum replacement/filtering (MSR/MSF). In experiments conducted on the AURORA-2 noisy digit database, the presented MSEW algorithms can achieve better recognition accuracy rates relative to the original MSR and MSF in various vehicle-noise environments.
Keywords
filtering theory; speech recognition; AURORA-2 noisy digit database; MSEW algorithms; MSR-MSF; SNR; feature stream magnitude spectra; modulation spectrum exponential weighting; modulation spectrum replacement-filtering; reference magnitude spectrum; robust speech recognition; signal-to-noise ratio; vehicle noise environments; vehicle-noise environments; Accuracy; Modulation; Robustness; Signal to noise ratio; Speech; Speech recognition; modulation spectrum; noise-robust feature; speech recognition; vehicle-noise environment;
fLanguage
English
Publisher
ieee
Conference_Titel
ITS Telecommunications (ITST), 2012 12th International Conference on
Conference_Location
Taipei
Print_ISBN
978-1-4673-3071-8
Electronic_ISBN
978-1-4673-3069-5
Type
conf
DOI
10.1109/ITST.2012.6425295
Filename
6425295
Link To Document