Title :
SNR estimation based on amplitude modulation analysis with applications to noise suppression
Author :
Tchorz, Jürgen ; Kollmeier, Birger
Author_Institution :
AG Medizinische Phys., Univ. Oldenburg, Germany
fDate :
5/1/2003 12:00:00 AM
Abstract :
A single-microphone noise suppression algorithm is described that is based on a novel approach for the estimation of the signal-to-noise ratio (SNR) in different frequency channels: The input signal is transformed into neurophysiologically-motivated spectro-temporal input features. These patterns are called amplitude modulation spectrograms (AMS), as they contain information of both center frequencies and modulation frequencies within each 32 ms-analysis frame. The different representations of speech and noise in AMS patterns are detected by a neural network, which estimates the present SNR in each frequency channel. Quantitative experiments show a reliable estimation of the SNR for most types of nonspeech background noise. For noise suppression, the frequency bands are attenuated according to the estimated present SNR using a Wiener filter approach. Objective speech quality measures, informal listening tests, and the results of automatic speech recognition experiments indicate a substantial benefit from AMS-based noise suppression, in comparison to unprocessed noisy speech.
Keywords :
amplitude modulation; feature extraction; filtering theory; integral equations; microphones; neural nets; noise abatement; pattern recognition; signal classification; signal representation; spectral analysis; speech processing; AMS-based noise suppression; SNR estimation; Wiener filter; amplitude modulation analysis; amplitude modulation spectrograms; analysis frame; automatic speech recognition; center frequencies; feature extraction; frequency band attenuation; frequency channels; informal listening tests; modulation frequencies; neural network classification; neural network pattern recognition; neurophysiologically-motivated spectro-temporal input features; noise representation; nonspeech background noise; objective speech quality measures; signal-to-noise ratio; single-microphone noise suppression algorithm; speech representation; unprocessed noisy speech; Amplitude estimation; Amplitude modulation; Background noise; Frequency estimation; Frequency modulation; Neural networks; Noise level; Signal to noise ratio; Spectrogram; Speech enhancement;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2003.811542