Title :
Noise robust speech recognition by combining speech enhancement in the wavelet domain and Lin-log RASTA
Author :
Jie, Yang ; Zhenli, Wang
Author_Institution :
Sch. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
Abstract :
For improving noise robustness of speech recognition under adverse noise environment, a method of noise robust speech recognition, which combines discrete wavelet transform (DWT), wavelet packet decomposition (WPD) and Lin-log RASTA, is researched in this paper. After one scale of DWT was employed for noisy speech, this method used three scales of DWT and three scales of WPD for the low frequency signal and the high frequency signal, respectively. Multithresholds processing and decision of unvoiced sounds and voiced sounds were also adopted in order to improve the performance of denoising. The Lin-log RASTA coefficients were then computed from the enhanced speech as feature vectors. Cepstral mean subtraction (CMS) was used for compensating the speech distortion and residual noise of the above processing. Experimental results indicate that this method performs better for digital speech recognition than Lin-log RASTA, Spectral Subtraction+ Lin-log RASTA, mel-frequency cepstral coefficients (MFCC) and RASTA.
Keywords :
cepstral analysis; discrete wavelet transforms; speech enhancement; speech recognition; Lin-log RASTA; cepstral mean subtraction; discrete wavelet transform; feature vector; multithreshold processing; noise robust speech recognition; residual noise; speech distortion; speech enhancement; wavelet packet decomposition; Acoustic noise; Discrete wavelet transforms; Frequency; Low-frequency noise; Noise robustness; Speech enhancement; Speech recognition; Wavelet domain; Wavelet packets; Working environment noise; Cepstral Mean Subtraction (CMS); Spectral Subtraction (SS); noise robust speech recognition; speech enhancement;
Conference_Titel :
Computing, Communication, Control, and Management, 2009. CCCM 2009. ISECS International Colloquium on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-4247-8
DOI :
10.1109/CCCM.2009.5267457