DocumentCode :
739168
Title :
Robust speech recognition by using spectral subtraction with noise peak shifting
Author :
Peng Dai ; Ing Yann Soon
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
7
Issue :
8
fYear :
2013
fDate :
10/1/2013 12:00:00 AM
Firstpage :
684
Lastpage :
692
Abstract :
In this study, a novel technique that recovers the temporal structure of speech power spectrum is proposed. The histogram of average speech log power spectrum shows that the contamination of noise leads to the shift of noise peak, which in return degrades the performance of speech recognition systems. A two-step scheme is proposed to weaken the noise effects by first reducing the noise variance and then shifting the noise mean. The proposed algorithm consists of two parts, two-dimensional smoothing and controlled noise subtraction, which leads to the name SNS. The proposed algorithm manages to solve the speech probability distribution function discontinuity problem caused by traditional spectral subtraction series algorithms. In contrast to the clean speech estimation methods, the proposed algorithm does not need a prior speech/noise statistical model, which makes it simple but effective. The effectiveness of the proposed filter is tested using the AURORA2 database. Very promising results are obtained, 88.59% for noisy speech (average from signal-to-noise ratio 0-20 dB). Comparison is made against eight state-of-the-art speech recognition algorithms. Overall the proposed algorithm produces significant improvements over the comparison targets.
Keywords :
interference suppression; probability; smoothing methods; spectral analysis; speech recognition; 2D smoothing; AURORA2 database; SNS; controlled noise subtraction; noise contamination; noise peak shifting; spectral subtraction series algorithm; speech estimation method; speech power spectrum; speech probability distribution function discontinuity; speech recognition; temporal structure;
fLanguage :
English
Journal_Title :
Signal Processing, IET
Publisher :
iet
ISSN :
1751-9675
Type :
jour
DOI :
10.1049/iet-spr.2012.0357
Filename :
6611359
Link To Document :
بازگشت