DocumentCode :
1226695
Title :
SNR estimation based on amplitude modulation analysis with applications to noise suppression
Author :
Tchorz, Jürgen ; Kollmeier, Birger
Author_Institution :
AG Medizinische Phys., Univ. Oldenburg, Germany
Volume :
11
Issue :
3
fYear :
2003
fDate :
5/1/2003 12:00:00 AM
Firstpage :
184
Lastpage :
192
Abstract :
A single-microphone noise suppression algorithm is described that is based on a novel approach for the estimation of the signal-to-noise ratio (SNR) in different frequency channels: The input signal is transformed into neurophysiologically-motivated spectro-temporal input features. These patterns are called amplitude modulation spectrograms (AMS), as they contain information of both center frequencies and modulation frequencies within each 32 ms-analysis frame. The different representations of speech and noise in AMS patterns are detected by a neural network, which estimates the present SNR in each frequency channel. Quantitative experiments show a reliable estimation of the SNR for most types of nonspeech background noise. For noise suppression, the frequency bands are attenuated according to the estimated present SNR using a Wiener filter approach. Objective speech quality measures, informal listening tests, and the results of automatic speech recognition experiments indicate a substantial benefit from AMS-based noise suppression, in comparison to unprocessed noisy speech.
Keywords :
amplitude modulation; feature extraction; filtering theory; integral equations; microphones; neural nets; noise abatement; pattern recognition; signal classification; signal representation; spectral analysis; speech processing; AMS-based noise suppression; SNR estimation; Wiener filter; amplitude modulation analysis; amplitude modulation spectrograms; analysis frame; automatic speech recognition; center frequencies; feature extraction; frequency band attenuation; frequency channels; informal listening tests; modulation frequencies; neural network classification; neural network pattern recognition; neurophysiologically-motivated spectro-temporal input features; noise representation; nonspeech background noise; objective speech quality measures; signal-to-noise ratio; single-microphone noise suppression algorithm; speech representation; unprocessed noisy speech; Amplitude estimation; Amplitude modulation; Background noise; Frequency estimation; Frequency modulation; Neural networks; Noise level; Signal to noise ratio; Spectrogram; Speech enhancement;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2003.811542
Filename :
1208288
Link To Document :
بازگشت